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Complexity  of  Human  Language  Comprehension 
Eric  Sven  Ristad 


Abstract: 

-  The  goal  of  this  research  has  been  to  understand  the  computational  structure  of 
principle-and-parameter  linguistic  theories:  what  computational  problems  do  these 
theories  pose  and  what  is  the  underlying  structure  of  those  computations?  To  do 
this,  I  have  analyzed  the  computational  problem  of  human  language  comprehension: 
what  linguistic  representation  is  assigned  to  a  given  sound?  This  language  com¬ 
prehension  problem  may  be  factored  into  smaller,  interrelated  (but  independently 
statable)  problems  defined  on  partial  phonological,  morphological,  and  syntactic 
representations.  For  example,  in  order  to  understand  a  given  sound,  the  listener 
must  assign  a  phonetic  form  to  the  sound;  determine  the  morphemes  that  compose 
the  words  in  the  sound;  and  calculate  the  linguistic  antecedent  of  every  pronoun  in 
the  utterance.  I  prove  that  these  and  some  other  subproblems  are  all  NP-hard,  and 
that  language  comprehension  is  itself  PSPACE-hard,  according  to  current  linguistic 
theory. 
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1  Introduction  to  the  Reductions 


If  linguistic  theorv  is  a  theorv  nf  t  lie  human  language  ability.  is  rational  sci¬ 
entists  must  claim,  then  linguistic  theorv  must  have  empirical  consequences. 
Some  of  those  consequences  will  be  computational.  At  the  verv  least,  lan¬ 
guage  users  must  use  the  linguistic  representations  extensionallv  character 
ized  by  the  generative  grammars  of  modern  linguistics.  This  in  and  of  itself 
states  a  class  of  computational  problems:  what  linguistic  representation  is 
assigned  to  a  given  sound,  what  articulation*  ex  ores',  a  given  underlying  lin¬ 
guistic  representation,  and  what  is  i  he  internal  state  of  a  language  learner  af¬ 
ter  a  given  sound?  Each  of  these  problems  -comprehension,  production,  and 
acqi.tctriori—  — ay  be  profit factored  into  smaller,  interrelated  tout  inde¬ 
pendently  statable)  problems  defined  on  partial  Linguistic  representations. 
The  language  comprehension  problem  includes  as  ;■  o-problems  the  phono¬ 
logical  problems  of  determining  stress,  rone,  and  sellable,  morphological, 
and  articulatory  structure;  the  syntactic  problems  of  det"c  ing  •‘inptv  ca’e- 
gories,  disambiguating  words,  deteriruning  phrase  and  argument  >rruitures. 
and  computing  the  linguistic  antecedents  of  anaphora.  More  abstractly, 
these  sub-problems  of  comprehension  consist  of  completing  an  independent  lv 
defined  portion  of  an  incomplete  Linguistic  representation. 

The  goal  of  my  doctoral  research  has  been  to  determine  the  computational 
structure  of  current  linguistic  theory:  what  problems  does  the  theorv  pose 
and  what  is  their  underlying  structure?  To  do  this.  I  have  studied  abstract 
problems  in  language  comprehension  with  computational  complexity  theory, 
a  theory  of  the  absolute  difficulty  of  solving  computation?!  problems,  in 
terms  of  their  natural  parameters.  The  major  technical  result  of  the  thesis 
is  that  these  computational  problems  are  all  intractable  and  cannot  be  solved 
in  practice. 

The  greatest  difficulty  of  the  research  lies  in  <  reating  precise  linguistic  mod¬ 
els  from  the  inconsistent,  incomplet  1  imprecise  linguistics  theories  that 
are  available  today.  These  models  e  sufficiently  abstract  to  captuie 

the  core  ideas  of  modern  linguistics  in  -r  to  remain  relevant  despite  rapid 
change  in  the  field.  My  approach  has  been  to  reduce  the  instances  of  some 
known  problem  P  into  a  class  of  linguistic  phenomena  such  that  under¬ 
standing  that  phenomena  corresponds  to  solving  P.  As  far  as  1  know,  the 
reductions  hold  for  all  current  linguistic  theories  that  explain  these  phenom¬ 
ena.  This  places  these  proofs  among  the  strongest  formal  results  achievable 


in  an  empirical  science. 

This  manuscript  contains  the  first  complexity  results  for  modern  ( 1980s ) 
generative  linguistic  theory,  a  task  thought  by  manv  linguists  to  be  impos¬ 
sible.  Moreover,  the  results  cover  the  entire  spectrum  of  linguistics:  phonol¬ 
ogy.  morphology,  and  each  level  of  syntactic  representation  ( D- structure. 
S-structure,  and  Logical  Form). 
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2  The  Complex  Structure  of  Sounds  and  Words 


Words  are  built  from  units  of  meaning  called  morphemes.  In  the  more  fa¬ 
miliar  languages  of  the  world,  such  as  Romance  languages,  morphemes  are 
concatenated  to  form  words:  this  process  is  ailed  affixation.  In  some  lan¬ 
guages.  such  as  Semitic  languages  where  vowels  are  morphemes,  a  morpheme 
mav  appear  more  that  once  inside  another  morpheme  (this  is  called  infixa¬ 
tion).  For  example,  the  Arabic  word  katab.  meaning  'he  wrote",  is  formed 
from  the  active  perfective  morpheme  a  infixed  to  the  ktb  morpheme. 

In  addition  to  morphemes,  words  consist  of  syllables.  Each  syllable  contains 
one  or  more  vowels  V  (its  nucleus)  that  may  be  preceded  or  followed  by 
consonants  C.  For  example,  katab  consists  of  two  syllables,  the  two-segment 
syllable  CV  and  the  three-segment  c.osed  syllable  C VC. 

Syllable  structure  can  interact  with  morphological  structure.  For  example, 
all  Arabic  words  must  end  in  the  closed  syllable  C’VC.  and  therefore  the  a 
morpheme  must  be  infixed  to  the  ktb  morpheme  in  order  to  form  a  word  with 
permissible  syllable  structure.  It  cannot  be  prefixed,  because  Arabic  does 
not  permit  sy. lables  of  the  form  VCCC;  nor  can  it  be  suffixed,  because  CCCV 
syllables  are  not  permitted.  These  interactions  appear  simple,  but  they 
can  become  extremely  complex  when  morphemes  are  ambiguous  or  when 
segments  are  underspecified  as  in  distorted  speech.  In  fact,  determining 
the  morphological  and  syllable  structure  of  an  underspecified  sound  is  like 
solving  the  satisfiability  problem  for  Boolean  formulas,  where  one-segment 
morphemes  are  Boolean  variables,  vowels  true  literals,  and  consonants  false 
literals.  Then  three-segment  syllables  would  correspond  to  satisfied  3-CNF 
clauses,  and  words  to  entire  3SAT  instances. 

Among  the  world’s  languages,  assimilation  is  one  of  the  most  common 
phonological  processes,  \ssimilation  is  the  process  whereby  some  segment 
comes  to  share  properties  of  another  segment  in  the  course  of  the  derivation 
of  surface  forms  from  underlying  forms.  For  example,  in  English,  conso¬ 
nant  nasality  assimilates  to  immediately  preceding  vowels;  assimilation  also 
occurs  across  morpheme  boundaries,  as  the  varied  surface  forms  of  the  pre¬ 
fix  /in-/  demonstrate:  /in-/  + /logical,  —  -illogical;  and  /in-/  + /probable/ 
—  t improbable /.  In  other  languages,  assimilation  is  unbounded  and  can  af¬ 
fect  nonadjacent  segments:  these  assimilation  processes  cue  called  harmony 
systems.  In  the  Turkic  languages  all  suffix  vowels  assimilate  the  backnesss 
feature  of  the  last  stem  vowel;  in  Capanahua,  vowels  and  glides  that  pre- 


/ 
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cede  a  word-final  deleted  nasal  (an  underlying  nasal  segment  absent  from 
the  surface  form)  are  all  nasalized. 

Harmony  systems  are  powerful  computational  mechanisms.  In  fact,  we  can 
virtually  duplicate  the  preceding  reduction  idea  merely  by  replacing  mor¬ 
pheme  infbcation  with  harmonic  features.  In  the  second  reduction  then, 
harmonic  features  will  represent  Boolean  variables,  and  harmony  processes 
will  ensure  consistent  truth  assignments  to  the  Boolean  variables;  the  other 
details  are  essentially  unchanged. 

2.1  Word  Recognition  in  the  Nonlinear  Model 

In  phonological  theory,  segregated  representations  obey  simple  principles: 
this  section  examines  the  complexity  of  these  interacting  representations 
(planes  and  tiers)  and  morpheme  combination  (affixation  and  infbcation). 
In  order  to  do  this,  we  pose  the  word  recognition  problem; 

Word  Recognition  Problem  (WORD) 

Given  an  underspecified  timing  slot  vector  V,  and  a 
morphological  model  Af,  is  V  a  permissible  word  according  to 
Af? 

The  timing  slot  vector  is  an  abstract  representation  of  a  speech  sound  in 
terms  of  the  articulations  (phonemes)  necessary  to  produce  that  sound.  It 
is  underspecified  when  articulations  necessary  to  produce  a  word  of  the 
language  are  missing  from  the  representation. 

Theorem  1  WORD  is  NP-hard  in  the  rule- free  nonlinear  morphological 
model. 

Proof.  The  proof  is  by  a  reduction  from  3SAT  to  WORD.  The  reduction 
input  is  a  3-CNF  formula  F  containing  clauses  C i,  C 2, . . .  ,CP  in  the  Boolean 
variables  xx,  xj, .  ■  •  ,<rn.  Each  clause  contains  exactly  three  distinct  literals 
labeled  by  C,  =  (o^  V  6;  V  c,).  The  output  is  a  rule-free  nonlinear  model  M 
(list  of  morphemes,  feature  geometry,  and  syllable  templates)  and  an  under- 
specified  timing  slot  vector  V ,  such  that  V  is  a  permissible  word  according 
to  M  if  and  only  if  F  is  satisfiable. 


Each  variable  r,  is  represented  by  a  one- segment  morpheme  jx,.  and  each 
clause  Cj  is  represented  by  a  three-segment  syllable  ct}.  The  reduction  con¬ 
structs  the  morphemes  g-xP-z  ■  ■  ■  pn,  plus  two  features:  [rTRUE]  to  represent 
the  truth  value  assigned  to  a  variable,  and  [xSEG]  to  represent  whether  a 
literal  is  negated  or  not.  Segments  specified  with  the  features  [+SEG,  -TRUE] 
or  [-IEG.+TRUE]  represent  true  literals,  which  correspond  to  vowels.  For 
example,  the  false  literal  r*  is  represented  by  the  morpheme  with  the 
feature  vector  [+SEG,  +TRUE] . 

Syllable  templates  ensure  that  each  3-CNF  clause  contains  one  or  more 
true  literals  by  requiring  syllables  to  span  three  segments  and  contain  one 
or  more  true  literals  (FFT.  FTF.  FTT.  TFF.  TFT.  TTF,  TTT).  In  other 
words,  we  are  simply  excluding  FFF.  one  of  eight  possible  three-segment 
'truth  tempiates,'  which  corresponds  to  positing  a  language  where  syllables 
span  exactly  three  segments  and  must  include  a  vowel. 

The  timing  slot  vector  V'  contains  the  string  of  formula  literals,  where  each 
formula  literal  is  represented  by  a  morpheme  and  the  appropriate  setting  of 
the  IEG  feature. 

1  -  n  distinct  planes  are  needed:  a  single  one- tiered  plane  for  each  morpheme 
qua  variable,  and  one  two-tiered  plane  for  syllables  qua  satisfied  3-CNF 
clauses.  The  reduction  requires  0(n)  time  and  space  to  specify  the  nonlinear 
model  and  does  not  use  any  morphological  rules,  which  strongly  suggests 
that  the  nonlinear  model  is  more  powerful  than  ,VP.Q 

2.2  Sound  Recognition  in  the  Nonlinear  Model 

Sound  Recognition  Problem  (SOUND) 

Given  an  underspecified  timing  slot  vector  V,  and  a 
phonological  model  M,  is  V  a  permissible  phonological 
representation  according  to  Ml 

Theorem  2  SOUND  is  NP-hard  in  (he  rule-free  nonlinear  phonological 
model. 

Proof.  The  proof  is  by  reduction  from  3SAT  to  SOUND,  and  follows 
the  preceding  proof  of  theorem  1  in  its  idea.  Phonological  harmony  will 
replace  morpheme  infixation  as  the  linguistic  process  that  ensures  consis¬ 
tent  Boolean  truth  assignments.  As  before,  the  reduction  input  is  a  3-CNF 


3-clause  -  3-clause  - 


ic,-true; 


Figure  1:  Nonlinear  morpho-phonoiogical  representation  of  the  3SAT  formula  /  = 
(a  V  b  V  c)  A  (a  V  6  V  c),  satisfied  by  a.b  true  and  c  false.  The  timing  slot  vector, 
drawn  horizontally  in  the  center  of  the  figure  with  a  heavy  line,  represents  the 
string  of  formula  literals.  Each  of  the  three  one-segment  morphemes,  drawn  below 
the  timing  slot  vector  with  noncrossing  association  lines,  resides  on  its  own  plane 
and  represents  a  formula  variable.  The  final  two  tiers,  drawn  above  the  timing  slot 
vector,  represent  satisfied  3-CNF  clauses:  one  T,F-tier  for  literals  and  a  second  tier 
for  satisfied  3-CNF  clauses. 
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formula  F  and  the  output  is  a  <  partial  I  rul^-fr*-**  nonLiiu-ar  phonological 
model  M  (rooted  feature  geotnetrv.  -v liable  templates,  and  li-t  of  harmuriv 
processes  I  and  an  underspecified  riming  slot  vec'or  l  .  mkIi  that  1  is  a 
permissible  phonological  representation  according  to  M  if  and  onlv  if  F  is 
satisftabie. 

Each  variable  x,  is  represented  hv  a  fdnarv  piace-of  articulation  feature  p 
the  truth  assignment  to  x.  is  encoded  ,o  the  biiwuv  harmonic  .art uulatorv 
feature  /,:  and  each  clause  C  is  represented  hv  a  syllable  c  spanning  three 
segments.  The  reduction  also  constructs  the  binarv  feature  [  ;  NEG]  to  rep¬ 
resent  whether  a  literal  is  negated  or  not.  Each  harmonic  feature  /,  is  imme¬ 
diately  dominated  by  a  place-of-articulation  feature  />. .  which  is  connected 
to  the  root.: 


Segments  specified  with  the  features  [p.  l ,  /, 0 ,  +BEG]  or  [p.  1 ,  /.  1 , -KEG]  rep¬ 
resent  true  literals  of  the  variable  x.,  and  will  correspond  to  vowels.  For  ex¬ 
ample,  the  false  literal  ?7 is  represented  by  the  feature  matrix  [p4 1 ,  /,  1 ,  +SEG] 

Being  harmonic,  each  feature  /,  will  receive  the  same  value  throughout  the 
timing  slot  vector.  As  before,  syllable  templates  ensure  that  each  3-C'NF 
clause  contains  one  or  more  true  literals  by  requiring  syllables  to  span  three 
segments  and  contain  a  true  literal.  The  timing  slot  vector  V’  contains 
the  string  of  formula  literals,  where  earb  formula  literal  is  represented  by  its 
place  of  articulation  feature  n.  and  the  appropriate  setting  of  the  SEG  feature. 
For  example,  the  negated  Literal  77  i  represented  bv  the  underspecmea 
feature  matrix  ip*  1 ,  +SEG]  .j  j 


Comments.  The  reductions  to  the  nonlinear  model  are  blocked: 


If  planar  interaction  is  limited  to  superposition  of  tier  features.  In  this 
highly  restricted  nonlinear  model,  each  articulatory  feature  can  he 
associated  with  at  most  one  plane:  planes  are  trulv  computational] v 
independent  because  they  interact  only  in  that  they  each  may  affect 
the  acoustic  signal.  But  this  nonlinear  superposition  model  i»  far  too 
impoverished  for  natural  language,  where  all  phonological  processes 
have  access  to  one  small  fixed  set  of  articulators  and  hence  must  make 
maximal  use  of  their  representational  possibilities  (McCarthy,  1981 1. 
For  example,  consonants  and  vowels  are  segregated  into  distinct  c.v 
planes  and  are  representat ionallv  distinguished  by  the  non- phonetic 
consonant  feature.  But  consonants  and  vowels  must  both  use  the 
same  set  of  phonetic  features  in  articulation.  They  are  distinguished 
in  articulation  only  by  the  obstruction  or  nonobstruction  of  the  air 
flowing  from  the  lungs  to  the  lips.  (Consonants  require  an  articulator 
to  contact  the  stationary  part  of  the  vocal  tract.)  But  “this  striking 
difference  between  the  production  of  vowels  and  consonants  must  not 
be  allowed  to  obscure  the  obvious  fact  that  for  the  production  of  both 
types  of  sounds  speakers  have  at  their  disposal  only  a  single  piece  of 
anatomical  machinery,  the  vocal  tract  with  its  six  articulators.  . 
(Halle.  1987'. 

(  Nonetheless,  the  idea  of  the  second  reduction  is  problematic,  because 
everv  segment's  consonant  feature  is  directly  affected  by  some  har¬ 
monic  feature  in  that  segment.  I  do  not  know  of  any  harmony  processes 
wit  h  t  his  character. ) 

In  the  reductions,  the  timing  slot  vector,  which  is  an  abstract  rep¬ 
resentation  of  the  acoustu  signed,  contains  sufficient  information  to 
identify  each  literal  morpheme  bdace  of-articulation.  while  systemat¬ 
ically  omitting  the  feature  that  encodes  truth  assignment.  At  first 
glance  this  is  plausible  '■  cause,  as  is  well  known,  human  speech  is 
fantastically  resistant  to  di-tort  ion.  no  matter  how  underspecified  the 
acoustic  signal  may  he. 

One  way  to  block  the  rennet  ions  would  he  !u  require  the  nonlinear 
representation  to  hr  stable  with  small  changes  in  perceived  phonetic 
features.  As  things  stand,  this  is  not  a  property  of  the  preceding 
reductions,  which  would  fail  because  we  could  not  ensure  segregation 
of  formula  variables  or  consistency  of  truth  assignments  in  t  he  presence 
of  rioi-r  We  might  meet  this  stability  requirement  by  using  more 
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phonetic 

values. 


features,  to  redundant lv  identify 


variable-,  and  their  truth 


•  We  might  disallow  repeated  morpheme  inlixation  or  universally  hound 
the  number  of  harmony  processes  bv  a  -mall  constant.  Both  infixa- 
tion  and  harmony  are  bounded  in  known  languages:  no  language  is 
thought  to  have  more  than  two  harmonv  processes,  and  morpheme 
infixation  appears  to  be  limited  to  infixing  one  vowel  per  morpheme. 
Unfortunately,  there  is  no  elegant  wav  to  explain  boundedness  beyond 
mere  stipulation  in  an  approach  to  linguistic  theorv  that  distinguishes 
knowledge  and  ability,  as  generative  linguistics  does. 


2.3  Postmortem 

The  aim  of  the  preceding  analysis  was  to  determine  the  complexity  of  planar 
interactions,  harmony  processes,  and  morpheme  combination  in  the  non¬ 
linear  model.  The  reduction  very  clearly  shows  that  the  simplest  planar 
interactions  are  complex,  regardless  of  (the  potential  complexity  of)  both 
interplanar  computations,  such  as  stress  shift,  and  determining  the  corre¬ 
spondence  between  surface  and  underlying  forms.  With  respect  to  mor¬ 
pheme  infixation  and  phonological  harmony,  we  can  see  that  both  processes 
are  costly  when  coupled  with  articulatory  underspecification. 
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3  Structure  of  transformational  models 

The  number  of  levels  of  linguistic  representation,  their  qualities,  and  the 
rules  relating  them  are  central  concerns  in  generative  linguistics.  Within  the 
principles  and  parameters  approach,  syntactic  theories  have  been  broadly 
classified  both  bv  the  levels  of  representation  they  employ  (S-structure  and 
PF.  and  sometimes  D-st  .cture.  Np-structure.  and/or  LF)  and  by  the  rule 
determining  the  relation  between  a  trace  and  its  antecedent  (move-alpha 
versus  'rules  of  construal,'  such  as  Rizzi’s  chain  formation  algorithm.  Kayne's 
g-projection,  or  Koster’s  dynasty  formation). 

Transformational  theories  include  D-struccure  and  S-structure  represents 
tions,  and  hold  that  S-structure  is  derived  from  D-structuxe  by  successive 
move-alpha  transformations,  where  each  application  of  move-alpha  uniquely 
relates  a  trace  to  its  antecedent,  and  where  all  relations  betwe<  n  traces  and 
their  antecedents  determined  by  such  a  derivation  are  represented  at  S- 
structure.1 

3.1  Empirical  properties  of  move-a 

It  has  been  difficult  to  find  empirical  arguments  favoring  the  transforma¬ 
tional  approach  over  competing  ‘representational'  approaches.  Arguments 
based  on  exhibiting  natural  properties  of  D-structure  that  are  difficult  to 
state  at  S-structure  have  not  been  entirely  persuasive  because  S-structure 
(redundantly)  represents  all  information  represented  at  D-structure.  due  to 
trace  theory  and  the  projection  principle.  Thus,  natural  properties  of  D- 
structure  will  also  be  naturally  >tatable  at  S-structure  in  terms  of  traces. 

Arguments  for  transformational  theories  based  directly  on  empirical  proper¬ 
ties  of  move- alpha  derivations  have  been  the  most  persuasive,  simply  because 
theories  lacking  move-alpha  derivations  will  lack  those  empirical  properties. 
Such  arguments  exhibit  properties  of  move-alpha  or  entire  derivations  that 
are  difficult  to  state  as  S-structure  rules  of  construal  between  traces  and 

'Modern  transformational  theories  are  often  misleadingly  named  'derivational'  in  con¬ 
trast  to  the  competing  'representational'  approaches.  These  widely-used  appelatmns  are 
misleading  because  all  generative  linguistic  theories  admit  explicitly  specified  represen¬ 
tations,  that  is,  representations  denned  according  to  some  effective  procedure.  The  only 
distinquishing  issues  are  the  role  of  move-alpha  m  the  derivation  of  syntactic  representa¬ 
tions,  and  the  status  of  D-structure,  NP-structure,  and  LF  in  the  theory. 
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their  potential  binders.  For  example,  in  his  Fall  19*7  class  lectures  (hom 
sky  argued,  with  empirical  evidence,  that  derivations  must  obev  a  ’least 
effort’  principle:  that  is,  the  number  of  derivation  steps  required  to  relate 
a  trace  to  its  S-structure  binder  rrui'f  be  minimal.  Such  global  pr<  perties 
of  derivations  are  thought,  withou'  proof  or  supporting  argument,  to  be 
difficult  to  express  as  S-structure  rules  of  construal. 

Another  powerful  argument  for  move-alpha  is  that  the  conditions  on  move- 
alpha  can  he  satisfied  neither  at  D-struct  tire  nor  S  structure,  but  ortlv  at 
an  intermediate  stage  of  the  derivation.  1  his  obtains  whenever  an  appli¬ 
cation  of  move-alpha  disturbs  the  obligator*  structural  rela*  ''>r  tvpicallv, 
c-command)  between  a  previously  moved  constituent  and  its  trace,  as  when 
a  constituent  J  containing  a  trace  f,  is  moved  outside  tiie  c-command  domain 
of  its  antecedent  a,,  as  in  ( l ). 

U) 

a.  ... 

b.  ...  j ...  <>,  ■■■t j 

Then  the  relation  between  the  trace  t,  and  its  binder  o,  will  not  obev  the 
same  structural  conditions  at  S-structure  as  other  traces  and  their  binders 
do.  In  (lb),  t,  is  neither  c  commanded  nor  bound  by  its  antecedent  at  S- 
structure.  yet  the  resulting  S-stnicture  is  surely  permissible,  as  in  passive 
VP  topicalization  structures: 

^  *  a.  John,  has  never  been  vp  arrested  t,  by  the  FBI 

vp  arrested  f,  by  the  FBI  j  John,  has  never  been  tj 

(The  complexity  proofs  below  will  relv  on  this  type  ot  "rolling  movement’’ 
construction.)  The  choice  between  transformational  and  alternate  theories 
ultimately  reduces  to  the  necessity  of  representations  intermediate  between 
D-structure  and  S-structure:  if  intermediate  representations  are  necessary, 
as  has  been  argued  in  the  literature,  then  syntactic  representations  must 
include  ordered  transformational  derivations. 

3.2  Conceptual  objection  to  transformations 

Transformational  grammars  (TGs)  >eek  to  explicitly  relate  linguistic  struc¬ 
tures  via  a  sequence  of  transformations.  For  example,  in  transformational 
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theories,  a  transformation  relates  an  active  sentence  to  its  passive  variant. 
But  there  is  a  major  conceptual  problem  with  this  approach  to  linguistic 
theory:  TGs  do  not  construct  explicit  representations.  In  a  TG,  linguistic 
relations  are  implicitly  encoded  both  in  the  underlying  and  surface  forms 
and  in  the  series  of  transformations  required  to  derive  the  surface  form  from 
its  underlying  form.  In  effect,  transformations  serve  double  duty,  both  as  a 
linguistic  representation  and  as  the  derivational  rules  of  a  model  of  compu¬ 
tation. 

For  example,  in  transformational  grammars  elements  are  nonlocally  related 
in  a  derivation  of  syntactic  forms  using  the  unbounded  one-to-one  move-a 
transformation,  locally  bounded  by  linguistic  principles  such  as  the  empty 
category  principle  (ECP)  and  chain  condition.  The  representations  assigned 
by  these  theories  (chains)  include  intermediate  constituents  (traces)  whose 
only  apparent  role  is  to  allow  the  iterated  local  representation  of  nonlocal 
relations.  Intermediate  traces  represent  only  the  history  of  the  computation 


of  nonlocal  relations.2 


As  a  consequence  of  this  failure  to  separate  representation  and  the  process 
whereby  it  is  constructed,  important  linguistic  relations,  such  as  the  nonlocal 
antecedent-trace  relation,  are  obscured  by  the  transformational  svstem.  This 
is  also  the  reason  why  the  transformational  component  of  current  linguistic 
theories  (move-alpha)  bears  so  no  resemblance  to  the  rest  of  the  theory  (that 
is,  to  phrase  and  argument  structure;  to  case,  control,  binding,  and  theta 
relations:  or  to  the  lexicon). 

Why  should  a  linguistic  theory  seek  to  hide  the  structure  of  linguistic  rep¬ 
resentations  in  order  to  explicitly  encode  the  derivational  relations  between 
those  representations?  This  corresponds  to  statement  of  the  .'IS  AT  prob¬ 
lem  in  terms  of  truth  assignments  and  rules  that  derived  all  possible  3SAT 
instances  satisfied  by  a  common  truth  assignment  from  that  given  truth 
assignment.  What  would  be  the  point?  The  best  statement  of  a  problem 

"These  intermediate  traces  are  the  result  of  a  particular  conception  of  the  ECP  as  a  local 
bound  on  move-a.  I  would  be  more  persuaded  of  their  existence  if  they  interacted  with 
other  grammatical  components.  For  example,  they  might  have  binding  or  phonological 
effects.  Adjunct  traces  may  satisfy  the  EC  P  only  via  antecedent  government  at  LF;  as 
a  consequence,  adjunct  extraction  results  in  intermediate  traces  that  may  not  be  deleted 
at  S-structure.  Thus,  the  only  intermediate  traces  required  at  S-structure  are  the  traces 
of  adjunct  extraction,  but  these  non-case-marked  traces  do  not  block  u>ant~to  —  manna 
contraction,  which  is  only  blocked  by  case-marked  elements  (Chomsky  1 986a:  1 62 f.  For 
example: 

(3)  how  do  you  wanna  solve  the  problem? 

As  expected,  the  intermediate  traces  in  specifier  of  CP  and  adjoined  to  VP  do  not  block 
phonological  contraction.  Neither  do  these  intermediate  A-traces  affect  binding  relations, 
whose  domain  is  NPs  in  A-positions. 

(4)  which  woman),  did  John  )vp  dream 

,cp  .ip  Bill  [  1°  vp  .vp  saw  |  .  herself  |  pp  wlth  l<. 

The  governing  category  of  the  direct  object  is  IP  (the  complete  functional  complex),  and 
therefore  the  c-commanding  trace  t‘  adjoined  to  VP  could  bind  the  anaphor  in  object 
position  within  its  governing  category,  if  the  trace  were  in  an  A-position.  But,  as  ex¬ 
pected,  kerself  is  in  fact  unbound,  which  strongly  suggests  that  f,1  is  only  relevant  to  the 
computation  of  nonlocal  A-movement  as  constrained  by  the  ECP.  The  precise  formulation 
of  the  ECP,  and  the  existence  of  the  intermediate  traces  it  requires,  is  the  topic  of  much 
active  research  and  debate  But  the  fart  that  these  intermediate  traces  do  not  enter  into 
other  syntactic  relations  casts  doubt  on  their  explicit  syntactic  representation,  at  least  in 
my  mind. 


is  the  most  direct  characterization  of  problem  instances;  relations  among 
problem  instances  are  of  secondary  concern.  I  believe  that  the  primary  goal 
of  linguistic  theory  should  be  explicit  representation.  Relations  among  those 
representations,  which  are  of  secondary  concern,  will  be  self-evident  through 
the  structures  they  share  (by  accident,  as  it  were). 
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4  The  Complex  Structure  of  Sentences 


The  syntactic  structure  underlying  even  simple  constructions  can  be  quite 
intricate.  For  example,  according  to  current  syntactic  theory,  the  simple 
passive  sentence  John  was  seen  is  assigned  a  complex  syntactic  structure, 
whose  phrase  structure  component  follows: 

(5)  jp  NP  John  ,  was  vp  see^en  t , 

Most  interestingly,  each  word  interacts  strongly  with  every  other  word  in 
the  sentence.  The  passive  verb  see~en  selects  the  underlying  object  t,  and 
assigns  it  a  ‘patient’  thematic  role;  the  underlying  object  appears  as  the 
surface  subject  John ;  the  suhject  agrees  with  the  auxiliary  verb  teas,  which 
assigns  it  nominative  case:  and,  to  complete  the  circle  of  interactions,  the 
auxiliary  verb  selects  the  passive  verb.  These  properties  of  words,  such 
as  case-marking,  thematic  role  assignment,  selection  and  agreement,  are  all 
independent,  not  directly  deducible  from  the  phonological  form  of  the  words, 
and  potentially  correlated  in  the  lexicon. 

It  is  easy  to  see  that  interactions  among  the  words  in  a  sentence  can  po¬ 
tentially  become  extremely  complex.  For  example,  imagine  if  the  lexicon 
contained  two  slightly  different  verbs,  see\  and  see 2.  with  the  same  phono¬ 
logical  form  (homophones)  but  different  selectional  restrictions.  Then  verb 
phrases  could  encode  satisfied  2-CNF  clauses:  seej  would  be  false  and  se¬ 
lect  a  true  argument:  conversely,  see2  would  be  true  and  select  either  a 
true  or  false  argument.  The  consequence  is  that  any  verb  phrase  headed 
by  see  would  contain  a  word  representing  a  true  literal.  We  could  even  get 
two  literals  of  the  same  variable  to  agree  on  truth  value  by  moving  one  to 
the  subject  position  of  the  other,  where  they  must  agree,  exactly  as  in  the 
passive  construction:  the  underlying  object  moves  to  the  subject  position, 
where  it  must  agree  with  the  auxiliary  verb.  Then  if  words  were  Boolean 
literals,  it  might  even  be  possible  to  encode  3SAT  instances  in  sentences. 


5  Complexity  of  Move-tv 

Transformational  derivations  have  played  a  central  historical  role  in  gener¬ 
ative  linguistics.  In  each  case,  transformational  models  have  been  proven 
intractable  ( Peters  and  Ritchie,  1973:  Rounds,  1975).  To  investigate  whether 
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current  transformational  models,  based  on  the  generalized  move-a  transfor¬ 
mation.  share  the  computational  characteristics  of  their  ancestors,  we  define 
a  natural  problem  posed  by  our  language  ability  and  determine  its  com¬ 
plexity  according  to  the  linguistic  model.  Consider  the  lexical  resolution 
problem,  a  subproblem  of  language  perception: 

Lexical  Resolution  Problem  (LRP) 

Given  an  S-structure  5  with  ambiguous  or  underspecified 
words,  and  lexicon  L  containing  ambiguous  words,  can  the 
words  in  5  be  found  in  the  lexicon  LI 

Theorem  3  The  LRP  is  SP-hard  in  transformational  models  with  move-a 


Proof.  The  proof  is  by  a  reduction  from  3SAT  to  LRP;  it  begins  with 
a  sketch  of  the  proof  idea  along  with  a  simple  example.  I  then  state  the 
essential  characteristics  of  the  transformational  models  necessary  for  the 
reduction  to  succeed. 

The  input  to  the  reduction  is  a  3-CNF  formula  F  containing  clauses  Ct,  Cj, 
. . Cp  in  the  Boolean  variables  xi,  X2,  ■  ■  ■ ,  £n-  Each  clause  contains  exactly 
three  distinct  literals  labeled  by  C\  =  (a,  V  b,  V  c,).  The  output  is  a  lexicon 
L  and  an  S-structure  5  containing  underspecified  words  such  that  the  words 
in  5  can  be  resolved  found  in  L  if  and  only  if  F  is  satisfiable. 

The  first  step  is  to  create  a  D-structure  that  represents  F,  where  the  rth  con¬ 
stituent  represents  the  ith  formula  literal  and  selects  the  i  +  1th  constituent 
(see  figure  2).  The  selectional  properties  of  constituents  will  ensure  that 
each  3-clause  contains  at  least  one  true  literal,  although  lexical  ambiguity 
will  prevent  us  from  knowing  which  literals  in  the  3-clause  are  true.  To  do 
this,  the  first  literal  a,  of  a  3-clause  C,  must  promise  to  make  the  3-clause 
true,  either  by  being  true  itself  or  by  selecting  a  literal  that  promises  to  make 
the  3-clause  true;  to  fulfill  its  promise,  the  second  literal  6,  must  either  be 
true  or  select  a  true  literak 


a,  true 
a,  false 


a,  true 

b,  false - «.  true 


(If  a,  is  true,  it  selects  the  next  literal  6,  with  any  truth  value.) 


/ 
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Given  variable  truth  values,  affixes  listed  in  the  lexicon  determine  literal 
truth  values;  they  will  negate  or  preserve  variable  truth  values,  according  to 
F  (see  below). 

Then,  scanning  from  right  to  left,  each  literal  moves  to  the  specifier  position 
of  the  closest  literal  of  the  same  variable,  either  by  long  movement  or  by 
successive  cyclic  movement  (see  figures  3,  1). 

In  the  resulting  S-structure,  the  specifier  position  of  the  uh  occurence  of  a 
variable  contains  the  i  +-  1th  occurence  of  the  same  variable,  and  agrees  with 
it  by  specifier- head  agreement:  all  but  the  first  occurence  of  a  variable  are 
contained  in  the  specifier  position  of  the  first  occurence  of  that  variable  (see 
figure  4). 

Now,  by  specifier-head  agreement  at  S-structure.  all  variables  have  consis¬ 
tent  truth  assignments,  and  by  D-structure  selection  all  clauses  contain  a 
true  literal,  where  negation  is  performed  by  affixes.  The  formula  is  satisfiable 
if  and  only  if  the  corresponding  D-  and  S-structure  are  well-formed.]  [ 

Requirements  on  the  transformational  model,  tacn  formula  literal 
is  represented  by  a  constituent  with  the  following  characteristics: 

1.  Transparent  to  extraction.  The  construction  must  permit  successive 
cyclic  movement  (typically  adjunction)  between  bounding  nodes  in 
order  to  satisfy  the  subjaceny  condition  of  bounding  theory. 

2.  Contains  a  landing  site  that  agrees  with  the  constituent  head.  The  con¬ 
stituent  will  contain  a  specifier  position;  the  head  of  the  constituent 
will  agree  with  the  specifier  position  and  assign  case  to  it;  the  land¬ 
ing  site  will  be  limited  to  literals  of  the  same  variable  by  identifying 
agreement  features  on  the  head. 

3.  Undergoes  obligatory  movement.  The  constituent  will  be  a  properly 
governed  argument  assigned  a  theta-role  but  no  case;  correspondingly, 
it  must  contain  a  properly  governed  caseless  position  that  it  selects 
and  assigns  a  theta-role. 

4.  Seleciional  properties  are  correlated  with  agreement  features.  Each 
constituent  will  contain  an  element  that  selects  the  constituent  rep¬ 
resenting  the  next  literal.  Local  affixation  rules  will  morphologically 
merge  the  head  of  the  construction  with  this  element,  thereby  corre¬ 
lating  selectional  properties  with  agreement  features  in  the  lexicon. 
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Figure  2:  On  the  input  3SAT  instance  F  —  (a  V  b  V  c),  (a  V  6  V  c),  a  D-structure 
is  created  to  represent  F,  where  each  literal  is  a  constituent  that  selects  the  literal 
immediately  to  its  right. 
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Figure  3:  Then  the  sixth  constituent  (c2)  moves  to  the  specifier  position  of  the 
third  constituent  (c2),  leaving  behind  a  trace  t{.  This  transformation  relates  the 
r2  literal  of  the  second  clause  in  the  3-CNF  formula  to  the  cl  literal  of  the  first 
clause.  (This  example  assumes  constituents  transparent  to  movement.)  Now  both 
constituents  agree,  by  specifier-head  agreement;  therefore,  the  corresponding  literals 
of  the  formula  variable  c  will  be  assigned  the  same  truth  value,  even  though  they 
appear  in  different  clauses. 


Figure  4:  The  end  result  is  an  S-structure  wnere  specifier-head  agreement  ensures 
that  all  instances  of  a  variable  receive  the  same  truth  value. 
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The  reduction  succeeds  for  any  transformational  theory  that  allows  such  a 
(potentially  intricate)  constituent.  This  provides  additional  evidence  to  sup¬ 
port  the  speculation  of  Ristad  (l!)8b)  that  all  linguistic  models  accounting 
for  syntactic  agreement  and  lexical  ambiguity,  as  all  descriptively  adequate 
models  must,  will  give  rise  to  intractability. 


—  . 
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6  Complexity  of  Move-o  with  Trace-Deletion 

A  central  question  for  current  transformational  theories  of  syntax,  such 
as  the  trace- deletion  approaches  of  Lasiuk  and  Saito  (15)84)  and  Chomsky 
( 1986),  is  what  are  the  consequences  of  interacting  agreement  relations,  such 
as  specifier-head  agreement,  head-head  agreement,  head-projection  agree¬ 
ment,  and  the  various  forms  of  chain  agreement  (link,  extension,  composi¬ 
tion)?  This  question  is  especially  important  because  although  these  forms  of 
agreement  are  explicated  in  distinct  definitions,  they  are  all  performed  with 
the  same  mechanism  of  coindexing.  Thus,  these  models  claim  that  there  is 
only  one  form  of  agreement,  disjunctively  determined  by  the  interaction  of 
complex  rules  and  principles. 

In  the  Barriers  model  of  Chomsky  (1986b  blocking  categories  (BCsi  stop 
unbounded  application  of  move-a.  Informally,  a  BC  is  a  category  not  i>- 
marked  by  a  lexical  .V°.  For  example,  matrix  verb  phrases  are  BCs  because 
they  are  selected  hv  the  nonlexicai  category  1°  (inflection)  without  being 
assigned  a  tf-role.  Unbounded  A-movement  becomes  possible  when  a  cate¬ 
gory  is  moved  local  steps,  adjoining  to  intermediate  nonargument  positions 
before  moving  on  (adjuncrion  is  typically  to  BCs).  As  in  GPSG.  this  notion 
of  strict  phrase  structure  locality  does  not  suffice  to  prevent  computational 
intractability  unless  local  agreement  (specifier-head,  head-headi  can  be  de¬ 
coupled  from  nonlocal  agreement  (chains),  which  may  be  impossible. 

Theorem  4  The  LRP  is  XP-hard  in  the  Barriers  model. 

Proof.  The  idea  of  the  proof  is  from  theorem  3  above;  although  the  idea 
is  simple,  the  actual  details  are  quite  complex.  On  input  3- CNF  formula  F. 
the  reduction  wtU  create  a  lexicon  I.  and  a  0 -structure  that  represents  F. 
then  apply  move-a.  and  finally  lr.’.ert  'underspecified  lexical  forms  to  derive 
an  incomplete  S-structure  that  can  be  completed  according  to  the  lexicon 
L  if  and  only  if  F  is  satisfiable.  The  crux  of  the  reduction  is  to  represent 
each  literal  wuth  the  noun  complement  structure  .V  /  VP  in  I  1)  (for 
example,  desire  to  visit  places’ ): 

[sp,  ---(.vp,  -V  ipx  e  /  \Pt  V  VP, .  ,  jj  (I) 

The  Barriers  model  endows  this  construction  with  the  required  characteris- 
t  ics: 


2;} 


1.  Transpart  nt  to  extraction.  X  P , .  j  ran  be  moved  out  of  NP,  in  the 
structure  (1).  VP  is  a  BC  and  barrier  for  XP,.i  because  it  is  not 
L-marked,  but  NP,_i  can  adjoin  to  the  nonargument  \’P  and  void  its 
barrierhood  because  nonarguments  may  be  freely  adjoined  to.  Both 
NP,  and  IP,  are  L -marked,  and  therefore  are  neither  RC's  nor  barriers 
for  further  NP,.!  raising.  Thus,  NP,.!  can  be  A -moved  to  any  c- 
coimnanding  specifier-of-IP  position  e  without  violating  the  EC'P  be¬ 
cause  all  traces  are  properly  governed  (both  0- governed  by  the  verb  V 
that  selects  X  P , .  ( ,  and  7  -marked  ( antecedent -governed )  by  the  deleted 
trace  adjoined  to  VP). 

Reinhart  (  personal  communication)  suggests  a  similar,  albeit  marginal, 
natural  example  where  an  XP  containing  an  argument  trace  is  topi- 
calired  to  CP  specifier  from  an  L-marked  position:3 

(7)  ?  What  burning  did  John  say  of  what  book  t,  would 

be  magnificent 

2.  Contains  a  landing  site  that  agrees  with  the  constituent  head.  The  in¬ 
ternal  IP,  contains  a  specifier  position  (landing  site)  that  will  agree 
with  I  by  specifier  head  agreement  in  nonlexical  categories;  the  spec¬ 
ifier  position  will  also  agree  with  N  (the  constituent  head),  by  predi 
cation.  Alternately,  head  movement  from  V  to  I  to  X  can  create  an 
inflected  noun  ,y  /  C  /  .V  in  the  A-0  position  of  VP,  that  will  agree 
with  the  landing  site.  Although  I  cannot  find  a  natural  example  of 
such  an  inflected  noun,  no  arguments  or  analyses  exclude  it  in  princi¬ 
ple.  A  close  natural  example  is  noun  incorporation  in  Mohawk  verbs 
( Baker  1985:139'. 

3.  I  ndergoes  obligatory  more  me  nt.  V  assigns  a  0-role  but  no  case  to 
the  X P ,  +  1  position,  requiring  XP..!  to  move.  This  is  possible  if  V 
has  lost  its  ability-  to  assign  case  (passive  morphology)  or  if  XP,_i  is 
the  underlying  subject  of  VP.,  as  in  many  current  accounts  (Sportiche 
198fi.  Fukui  i‘18b.  Larson  19X7.  et  cetera). 

’Chomsky  (pci  suggests  that  the  correct  analysis  of  (71  is 
(6)  what  burning  ,  did  J'.hn  sav  t<  of  what  hook  would  hr  magnificent 


and  that  a  brttrr  topualization  example  might  hr  What  burning  did  John  sag  (that  >/ 
that  boi'k.  \fary  thought  would  be  magnificent 


L.  t 
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4.  SeUctional  prope rtirs  art  correlated  with  p-feature  s.  V  undergoes  oblig¬ 
atory  head  movement  to  rhe  affix  1.  creating  an  inflected  verb  in  the 
head  of  IP.  As  noted  above,  rhe  ^-features  will  appear  on  the  inflected 
verb  by  specifier-head  agreement,  where  thev  may  be  systematically 
correlated  with  the  verb's  selectional  properties  in  the  lexicon. 


Details.  The  input  to  the  reduction  is  a  3  CNF  formula 

/  -  !  tin  Ui  J7/1  J  ).  (  U21  U22  u.3  )' ■  .  .  •  l  Un;  Un3  | 

with  n  clauses,  '.in  literals,  anti  variables  q±.  ..qm.  As  before,  the  reduction 
will  create  a  D-structure  that  represents  F.  and  then  applv  move-o  and 
lexical  insertion  to  derive  the  S-structure  5.  The  reduction  output  will 
consist  of  an  S-structure  V  containing  underspecified  words  and  a  lexicon  L. 
such  that  the  words  in  5  can  be  resolved  according  to  L  if  and  onlv  if  F  i> 
satisfiable. 

The  reduction  algorithm  consists  of  five  steps: 


1.  Create  the  two  p -features  VBL  a nd  TRUE 

The  rn-ary  VBL  feature  identifies  the  formula  variables:  th*  formula 
variable  <7*  is  associated  with  the  feature  [VBL  k] .  The  final  binary 
^'-feature  TRUE  denotes  the  truth  assignment  to  the  given  variable. 

2.  Create  the  D-structure  representation  of  f . 

For  each  literal  u,j  in  the  /*“  clause,  create  the  D-structure  noun  com 
plement  construction: 


(S) 


-Vo 


IP, 


VP. 


Kj  vp, 


J-  1 


where  NPtJ  represents  the  subformula  u,J...ur.i;  NP.  rejiresent- 
the  subformula  .  .  .  10,3;  and  bears  the  r'-features  of  the  vari¬ 

able  qk  that  corresponds  to  the  literal  u,,. 


3.  Apply  head-movement  within  noun  complements 

Move  V,j  to  I,,,  and  then  move  the  inflected  verb  V,.  to  N 


,\p. 


■j  ip , 


/ 
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(Lfl)  [.v/*, .  .1  I  /pi;  t,j  l  \  P ,}  t  \P,j+ 1  | 

As  a  result,  all  A’0  positions  other  than  N°  positions  have  traces  in 
them:  N°  positions  are  tilled  with  inflected  nouns  of  the  form  V  I 
N1.  (The  goal  of  this  operation  is  to  force  agreement  between  the  verb 
and  noun;  this  could  also  be  done  with  predication.) 

4.  Apply  long  distance  .\P-movement  across  noun  complements. 

Starting  with  the  rightmost,  innermost  NP  (NP^)  and  scanning  left 
and  up,  move  N P,;  to  the  specifier  position  e;m  of  the  closest  inflection 
hm  that  agrees  with  it  in  ^-features: 


•  ••  [yPlm  .. 

vi]  \tm ; 

JPlm  -f'~>  ' 

1  -  VPtm  t 

■  ■  [>•/’„  1 

v  -V  >y 

.  1  .  VP 

,  * 

•••  [xp,m  .. 

V  II 

tPlm  .  VPu 

/  1  .  VP lm 

1  [ypt,r..t 

•  ] 

The  leftmost  literal  of  each  formula  variable  is  assigned  case  in  situ 
bv  the  formula  literal  that  selects  it:  all  other  literals  of  the  variable 
receive  a  theta  role  but  no  case,  which  is  what  forces  them  to  move.  AO 
literals  of  a  variable  but  the  rightmost  one  assign  case  to  their  specifier 
position,  so  that  a  literal  will  be  able  to  land  there.  Our  third  ally  in 
making  the  movement  obligatory  is  the  extended  projection  principle, 
which  requires  specifier  of  IP  to  be  filled. 

Movement  is  bv  adjunction  to  intermediate  VPs,  deleting  intermediate 
traces.  This  movement  srU’sfies  the  ECP  because  all  traces  are  prop¬ 
erly  governed  and  therefore  -  -marked  at  S-structure  (both  ^-governed 
by  the  verb  \'tJ  that  selects  them,  and  antecedent-governed  by  the 
deleted  trace  adjoined  to  VP).  The  movement  satisfies  minimrlity  triv- 
iallv  because  the  deleted  trace  is  not  excluded  by  VP.  Because  VPs  are 
only  moved  to  specifier  positions  that  c-command  the;r  bound  traces, 
the  resulting  structure  a'so  satisfies  the  chain  rendition  and  binding 
theory  when  the  chain  is  created.  Note  that  the  movement  is  per¬ 
missible  independent  of  whether  we  express  the  antecedence  relation 
induced  by  mover*  using  indices  or  links  (cf.  Higginbotham,  1983). 

Now  N P u  agrees  with  its  c-commandine  NP;m  on  the  TRUE  feature  by 
specifier-head  agreement  in  IP  and  head-head  agreement  between  I;m 
and  the  head  of  NP,m. 


/ 


'if, 

5.  Perform  (underspecified)  lexical  insertion. 

Insert  the  morphological  form  7  /;.  for  tne  inflected  noun  V  1  N. 
when  ii^j  -  (that  is,  ,xu  is  not  a  negated  variable);  when  utJ  .  </£, 
insert  the  morphological  form  nrt  T/*  instead. 

(13)  v/  -W  vp.j  t  VPim  t  svtm.x  .  ; 

(14)  .  .  .  [ yplm  OiJn'rk.  !Pim  t  .  UP..,,  t  ,\P:,n  .  ,  ] 

Note  that  each  is  a  listed  word;  that  -y  and  7  .ire  sutfixes;  and 

that  /*.  is  the  morphological  realization  of  the  /-feature  [VBL  A-]. 

Further  note  that  the  surface  structure  is  a  string  of  inflected  words. 
For  example,  on  input  /  =  <7iV2</s<72q7<7'}  the  reduction  algorithm  would 
yield  the  surface  string  or  linguistic  expression: 

'*2-^2  "*  rl  Ct2^2'l'r2  °1-^1  f/2  h~!  a3J-t7/2 

We  now  turn  to  morphological  and  lexical  details  of  the  reduction,  namely 
how  we  ensure  that  each  clause  has  one  true  literal,  that  literals  and  variables 
may  he  true  or  false,  and  that  negation  changes  truth  values. 

The  lexicon  contains  ambiguous  inflected  nouns  F  [  .V  that  have  un¬ 
dergone  verbal  incorporation.  We  define  the  morphological  primitives  and 
lexicon  to  ensure  that  the  N-complement  structure  NPtl  for  the  first  literal 
in  the  ithclause  u,!  either  (a)  corresponds  to  a  true  literal,  or  (b)  selects  a 
true  literal  (as  represented  by)  NP:2,  or  (c)  selects  a  false  literal  NPl2  that 
will  select  a  true  literal  NP,3.  Similarly,  the  \  complement  structure  NPl2 
for  the  second  literal  in  the  rlhclause  u,2  must  either  (a)  correspond  to  a  true 
literal,  (b)  correspond  to  a  false  literal,  or  (c)  correspond  to  a  false  literal 
and  select  a  true  literal  NP,3.  Finally,  the  structure  NPl3  for  u,3,  the  third 
and  final  Literal  in.  the  dhclause,  may  correspond  to  a  true  or  false  literal. 

This  lexical  system  requires  one  /-feature  TRUE  on  nouns  to  encode  variable 
truth  assignments;  one  non-/-feature  LITERAL  to  encode  literal  truth  values 
on  both  nouns  and  verbs:  and  one  non-  /  feature  T  on  nouns  that  verbs  can 
select  for.  Recall  that  we  are  assuming  that  the  m  ary  /-feature  VBL  will 
he  morphologically  realized  on  the  nouns  in  the  lexicon,  and  will  distinguish 
the  formula  variables:  \,7  and  N,-.,,  bear  the  same  VBL  value  if  and  onlv  if 
u,j  and  u/.n  are  literals  of  the  same  variable. 
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The  morphological  system  is  constructeii  a>  Miming  the  relativized  head  def¬ 
inition  of  DiScuillo  and  Williams  (  19^7 1.  There  are  six  ambiguous  morpho¬ 
logical  primitives:  the  prefixes  oj.a;.o(  and  roots  Jj .  f>.  d-,.  Together  they 
define  three  ambiguous  listed  words  ni  fi,  ajJj,  and  n(di,  which  mav  each 
combine  with  either  of  the  two  nominal  suffixes  7.7. 


Listed  Word 

Prefix  (a,)  features 

Root  (J, )  features 

N  [LITERAL  l] 

V  [LITERAL  1] 

V [LITERAL  0, Select  T] 

N  [LITERAL  0] 

Q2J2 

N [LITERAL  l.T] 

V  [LITERAL  1] 

V  [LITERAL  0] 

N  [LITERAL  0] 

V [LITERAL  0, Select  T] 

N [LITERAL  0.T] 

c*  3/3? 

_1 

N [LITERAL  l.T] 

V  [LITERAL  1] 

V  [LITERAL  0] 

N  [LITERAL  0] 

The  suffixes  relate  variable  truth  assignments  to  literal  truth  values.  For 
example,  7  will  only  be  lexically  inserted  to  represent  a  negated  variable, 
and  therefore  inversely  relates  the  truth  value  of  the  variable  and  literal. 


Suffix 

Features 

Selects 

7 

N [TRUE  1] 

[LITERAL  1] 

N [TRUE  0] 

[LITERAL  0] 

7 

N [TRUE  1] 

[LITERAL  0] 

N [TRUE  0] 

[LITERAL  1] 

Finally,  there  are  m  listed  inflectional  affixes  corresponding  to  the  m  possible 
values  of  the  VBL  ^-feature.  This  completes  the  presentation  of  the  reduction 
algorithm.Qj 


Comments.  The  proof  exploits  a  Haw  in  the  A-system  of  local  9- relations 
described  in  Barriers  that  arises  because  distinct  linguistic  relations  (head 
chains,  NP-chains.  and  specifier-nead  agreement)  are  conflated  into  the  re¬ 
lation  of  A-chain  coindexing.  Abstracting  from  the  details  of  the  preceding 
reduction,  it  is  very  clear  that  complex,  undesirable  interactions  can  arise 
when  all  agreement  is  per  firmed  by  coindexing,  as  it  i»  in  both  trace-deletion 
models  considered  above.  Linguistically,  this  corresponds  to  conflation  of 
'local1  relations,  such  as  specifier-head  agreement,  and  'nonlocal'  relations, 
such  as  long  A-movement. 

The  lexicon  in  the  reduction  above  strongly  correlates  ^'-features  with  se¬ 
lections!  restrictions.  In  effect,  words  can  indirectly  select  the  selectional 


restrictions  of  their  complements.  In  mv  view,  this  is  perfectly  reasonable: 
it  is  well-known  that  the  properties  of  words  in  natural  lexicons  are  not  sta¬ 
tistically  independent,  and  statistical  correlation  is  perfectly  natural,  even 
expected,  in  anv  iisting  of  idiosyncratic  properties  (that  is.  a  lexicon l.  For 
example,  it  is  possible  to  imagine  a  natural  language  where  the  ^-features 
on  the  subject  are  correlated  with  the  object's  r'-features.  In  brief,  the  syn¬ 
tactic  and  lexical  models  permit  this  kind  of  correlation,  it  may  be  used  in 
complexity  proofs. 

The  most  serious  question  is  whether  the  complexity  of  the  lexical  resolution 
problem  accurately  reflects  on  the  complexity  of  language  perception  as  a 
w'hole.  The  S-structure  built  by  the  reduction  is  very  complex  and  I  have  not 
established  that  it  is  uniquely  determined  by  its  phonological  form.  If  not 
uniquely  determined,  then  hearers  would  be  free  to  perceive  some  simpler 
S-structure  with  the  same  phonological  form  as  the  intractable  S-structure 
built  by  the  reduction.  If  this  were  possible,  then  the  lexical  resolution 
problem  would  no  longer  be  a  realistic  subprobiem  of  linguistic  perception. 

Theorem  5  The  LRP  is  .\P-hard  in  the  Lasmk-Saito  model. 

Proof.  The  preceding  proof  proceeds  without  alteration  in  the  Lasmk-Saito 
model  because  in  that  model,  ^-government  suffices  for  proper  government, 
and  traces  may  be  deleted  at  any  level  after  y-markingj  | 

8.1  Blocking  the  Barriers  reduction  is  difficult 

How  might  we  change  the  Barriers  model  in  order  to  block  the  preceding 
reduction?  The  Barriers  model  appears  to  crucially  reiy  on  the  assumed 
atomicity  of  the  two  nonlexical  categories  C  and  I,  which  together  form  a 
barrier  to  argument  movement.  If  C  assigns  nominative  case  to  specifier-of- 
IP.  as  some  authors  have  proposed  ( Bennis  1980.  Dasgupta  1985),  then  CP 
and  IP  form  an  atomic  unit,  a  barrier  to  antecedent  government.  Although 
this  would  stop  the  exact  construction  used  in  the  reduction,  it  would  also 
leave  the  structure  of  the  infinitival  noun  complement  construction  unex¬ 
plained.  A  second,  theory-internal,  problem  with  this  change  is  that  now 
both  C  and  I  must  govern  the  specifier  of  IP  ((  m  order  to  assign  case,  and 
I  for  specifier-head  agreement),  which  violates  the  minimality  condition. 

The  preceding  proof  relies  on  long  movement  of  the  NP  complement  of  a  verb 
i  in  a  noun  complement  construction  j.  which  is  precisely  what  Barriers  strives 
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to  prevent  by  reducing  proper  government  to  antecedent  government,  using 
the  Lasnik-Saito  7-marking  mechanism.  (The  commitment  to  eliminating 
^-government  from  proper  government  is  tentative  at  best.  The  strongest 
position  taken  is  "'Possibly,  a  verb  does  not  properly  govern  its  0-marked 
complement,"  p.79).  In  the  Barriers  reduction,  an  argument  undergoes  long 
movement  by  adjoining  the  argument  NP  to  YrP,  7- marking  its  trace,  and 
then  deleting  the  intermediate  A-trace  at  LF. 

This  is  the  exact  derivational  sequence  (adjoin,  7-mark,  delete  adjoined 
trace)  used  in  Barriers  (pp. 21-22)  to  move  a  wh-subject  from  a  0-marked 
CP  complement  to  a  specifier  of  CP.  provided  the  wh-  phrase  is  licensed  at 
LF.  Barriers  attempts  to  exclude  similar  long  movement  of  an  NP  from  a 
similar  (but  caseless)  subject  position  by  appeal  to  Binding  Theory  condition 
C  at  S-structure:  the  NP  trace  in  subject  position  would  be  an  A-bound 
R-expression  A-bound  in  the  domain  of  the  head  its  chain  (p.93,  fn.20). 
(Barriers  differentiates  the  two  constructions  solely  by  the  nature  of  their 
traces:  wh-  traces  are  not  R-expressions,  while  NP-traces  are.)  Crucially, 
Chomsky's  argument  holds  only  if  the  trace  deleting  ability  of  affect-alpha 
is  restricted  to  LF,  contrary  to  the  Lasnik-Saito  model:  otherwise,  adjoined 
traces  could  be  deleted  before  causing  an  S-structure  binding  violation. 

But  trace  deletion  cannot  be  restricted  solely  to  LF.  If  it  were,  then  any 
ECP  violation  created  by  LF-movement  may  be  avoided,  simply  by  deleting 
offending  intermediate  traces  after  they  have  done  their  7-marking  duty. 
This  can  be  done  because  adjoined  A-traces  are  not  required  by  the  extended 
projection  principle  at  LF. 

Even  if  this  conundrum  could  be  resolved,  another  one  awaits  us:  the  long 
movement  used  in  the  proof  is  applied  cyclically,  so  that  the  trace  of  the  argu¬ 
ment  NP  is  no  longer  c-commanded  by  the  argument  NP  once  all  movement 
has  applied,  and  hence  not  A-bound  by  the  head  of  its  chain  at  S-structure. 
This  brazen  violation  of  the  c-cuuumand  condition  on  chain  links  is  standard 
in  the  literature,  and  hence  does  not  raise  any  special  problems  here.  Ex¬ 
amples  include  the  topicalization  example  (7)  above,  antecedent-contained 
ellipsis, 

(15)  [Everyone  that  Max  wants  to  e2 ! x  John  will  [kiss  ej[2 
and  passive  VP  topicalization  in  English: 

( 16)  [vp  Arrested  tt  by  the  FBIj^  John,  has  never  been  t:\ 


Furthermore,  even  if  trace  deletion  were  disallowed  entire!'.,  long  movement 
would  still  he  possible  from  ^-marked  noun  complements,  and  the  proof 
would  proceed,  because  0- government  cannot  be  elinunated  without  disas¬ 
trous  consequences  in  the  rest  of  the  theory.  Proper  government  can  be 
reduced  to  antecedent  government  only  if  antecedent  government  suffices 
for  N P- movement  (eg.  passive  and  raising)  in  accordance  with  the  chain 
extension  operation.  This  fails  because  only  the  terminus  of  an  (extended) 
A- chain  may  theta-mark  or  case-mark,  in  order  to  obtain  the  CF.D  effect 
(Condition  on  Extraction  Domains,  see  Barriers,  p.72).  Therefore,  in  pas- 
sive  constructions,  where  the  A  chain  headed  by  the  subject  NP  must  be 
extended  to  include  the  verb  and  inflection  and  thereby  achieve  antecedent 
government  of  the  NP- trace  at  S-structure,  the  inflection  will  simultaneously 
loose  its  ability  to  case-mark  the  subject  position.  The  direct  consequence 
is  that  Barriers  incorrectly  predicts  both  passives  in  (17)  violate  the  case 
filter  and  are  ungrammatical,  whereas  only  ( 17a)  is  ill-formed. 

^ '  a.  *  e;  was  killed  John 
b.  John  was  killed  t 

In  short,  the  chain  extension  required  to  satisfy  the  ECP  without  fheta- 
govemment  will  prevent  the  subject  NP  from  receiving  case,  and  thereby 
violate  the  case  filter.  This  very  serious  problem  may  be  remedied  by  aban¬ 
doning  either  (L)  the  case  filter,  which  would  without  question  be  disastrous 
for  the  theory.  (2)  the  Barriers  analysis  of  CED  effects,  which  would  reduce 
empirical  coverage,  or  (3)  the  coindexing  chain  extension  analysis  of  NP 
movement,  which  will  have  the  direct  consequence  that  proper  government 
cannot  be  reduced  to  antecedent  government. 

Additional  evidence  against  the  coindexing  chain  extension  analysis  of  NP- 
movement  comes  from  gapping  and  VP-deletion.  which  raises  a  contradiction 
for  the  Barriers  chain  extension  account  of  simple  passives:  (I)  each  passive 
conjunct  in  (18)  contains  a  distinct  R-expression  subject  that  therefore  must 
head  distinct  A-chains  with  differing  indices,  hut  (2)  both  extended  A-chains 
share  the  same  verb,  and  therefore  must  share  the  same  index. 


(18) 

a. 

b. 


John,  was  given  t ,  records  and  Sue,  e  books 
John,  was  killed  f,  k  but  Torn,  wasn't  e  ^ 


The  possibility  of  long  argument  movement  bv  adjunction  to  intermediate 
positions  remains  in  Chomsky's  most  recent  theory  of  derivation,  where  he 
proposes  that  derivations  be  subject  to  a  "least  effort  principle,’  with  the  fol¬ 
lowing  provisions.  LF  permits  only  the  following  five  elements:  arguments, 
adjuncts,  lexical  elements,  predicates,  and  operator-variable  constructions. 
Affect-alpha  must  apply  at  LF  to  each  illegitimate  object  to  yield  one  of 
these  five  legitimate  elements.  Chomsky  ( L988b:20)  urges  us  to  “consider 
successive-cyclic  A-bar  movement  from  an  argument  position.  This  will  yield 
a  chain  that  is  not  a  legitimate  object,  and  that  can  become  a  legitimate 
object,  namely  an  operator-variable  construction,  only  by  eliminating  inter¬ 
mediate  A-bar  traces.  We  conclude,  then,  that  these  must  be  deleted  by  LF." 
A  parallel  consequence  of  this  theory  of  derivations  is  that  successive-cyclic 
A-bar  movement  from  a  theta-marked  argument  position  to  a  case-marked 
argument  position  will  also  yield  an  illegitimate  object,  that  can  become 
a  legitimate  object,  namely  an  A-chain.  only  by  eliminating  intermediate 
A-bar  traces  by  LF  (that  is,  before  LF). 
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7  Controlling  complexity  in  systems  of  knowl¬ 
edge 

The  construction  used  in  the  preceding  reductions  may  appear  to  arise  more 
from  inconsistancics  in  the  transformational  model  than  from  human  lan¬ 
guage.  But.  as  [  argued  above,  no  simple  change  to  Barriers  can  block 
the  reduction  while  maintaining  the  theory's  intended  empirical  ‘'overage, 
because  every  property  of  the  reduction  construction  is  independently  jus¬ 
tified  in  the  theory.  More  accurately  then,  the  reduction  construction  is 
simply  a  complex,  unnatural  combination  of  the  properties  of  actual  natural 
constructions. 

In  order  to  explain  complex  phenomenon,  generative  linguists  invariably 
construct  complex  theories  that  are  difficult  to  change  and  computationally 
intractable.  In  my  opinion,  this  problem  is  inherent  in  the  attempt  to  con¬ 
struct  a  system  of  knowledge.  If  you  know  about  long  movement,  agreement, 
and  head  movement,  then  you  also  know  all  possible  combinations  of  those 
linguistic  processes.  Nor  is  a  cost  assigned  to  employing  those  combinations 
or  determining  the  consequences  of  knowledge.  Theories  of  knowledge  are 
inherently  non-modular  and  maximally  interactive— they  lack  techniques 
for  controlling  complexity  —and  this  is  why  it  is  so  difficult  to  construct  a 
complex  system  of  knowledge.  This  is  the  same  critical  flaw  that  doomed 
cybernetics. 

7.1  Taming  the  complexity  of  simple  operations 

Technically,  the  preceding  proofs  have  shown  that  simple  local  linguistic  op¬ 
erations  in  transformational  grammars  (including  local  agreement  and  move- 
q  bounded  by  the  ECF)  can  have  complex  consequences.  How  might  the 
complexity  of  simple  linguistic  operations  be  reduced?  Interactions  among 
operations  must  be  tamed.  As  things  currently  stand,  this  will  be  difficult  to 
do  for  the  transformational  models  considered  above  because  many  concep¬ 
tually  distinct  relations  (forms  of  syntactic  agreement)  are  uniformly  rep¬ 
resented  via  comdexing.  and  this  maximizes  interactions  among  principles. 
Therefore,  the  first  step  is  to  segregate  distinct  relations  in  representation. 
This  approach  has  led  phonologists  from  the  solely  segmental  representa¬ 
tions  of  SPE  to  nonlinear  models  (autosegmental,  prosodic).  In  syntax, 
the  potentially  distinct  anaphora-antecedent,  chain  link,  and  specifier-head 


agreement  relations  would  be  best  represented  by  distinct  relations,  rather 
than  by  the  uniform  co-indexing  mechanism.  Representational  theories  of 
movement  (eg.,  g-projection,  dynasty  formation,  chain  formation  algorithm) 
might  strive  to  decouple  principles  on  the  verge  of  orthogonality,  such  as 
binding  and  government  in  Rizzi's  relativized  minimality  approach. 

The  second  step  is  to  more  clearly  distinguish  linguistic  representations  from 
the  abstract  process  whereby  they  are  specified.  This  necessity  of  this  con¬ 
ceptual  distinction  has  been  clear  since  Descartes  first  argued  for  it  in  in¬ 
troductory  philosophy.  In  generalized  phrase  structure  grammar,  the  local 
tree  representation  is  sharply  distinguished  from  the  process  of  specifying 
the  local  trees  (metarule  finite  closure,  and  ID  rule  projection  according 
to  universal  feature  instantiation,  et  cetera).  The  distinction  is  similarly 
clear  in  representational  approaches  to  syntactic  movement:  chain  link  rep¬ 
resentations  are  specified  by  an  abstract  process  of  iteration  (g-projection, 
dynasty  formation,  or  chain  formation  algorithm)  whose  intermediate  states 
are  distinct  from  its  output.  But  in  transformational  approaches,  the  fun¬ 
damental  distinction  between  representation  and  process  is  blurred,  because 
the  chain  link  representation  is  simply  the  history  of  the  process  of  applying 
the  rnove-a  transformation.  In  such  a  model,  it  is  unclear  if  the  difficulty  of 
constructing  the  Unguistic  representation  is  inherent,  or  merely  an  artifact 
of  an  unnecessarily  complex  specification  process. 

Neither  representational  segregation  nor  a  sharper  distinction  between  pro¬ 
cess  and  representation  entails  a  reduction  in  the  complexity  of  simple  prin¬ 
ciples.  But  both  steps  would  certainly  clarify  the  fundamental  computa¬ 
tional  structure  of  linguistic  theory:  hopefully,  this  clarification  will  lead  to 
an  improved  understanding  of  the  inherent  complexity  of  simple  linguistic 
principles. 
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8  Syntactic  Binding 

Every  child  knows  that  pronouns  stand  for  nouns,  and  that  speech  would 
soon  become  extremely  difficult  without  them.  We  would  soon  grow  tired  of 
repeating  the  nouns  pronouns  stand  for  over  and  over.  Nor  could  we  easily 
introduce  ourselves  to  fellow  travelers  and  proceed  to  discuss  our  respective 
destinations. 

Consider  the  problem  of  determining  pronoun  antecedents.  Two  pronouns 
can  share  an  antecedent  only  if  they  agree  and  they  are  not  too  close.  Thus, 
the  pronoun  her  cannot  stand  for  Bill  in  the  sentence  Bill  saw  her  mother 
because  her  is  not  masculine  gender.  And  the  sentence  Bill  knew  he  liked 
him  cannot  mean  'Bill  knew  Bill  liked  Bill,’  unlike  the  sentence  Bill  knew  he 
liked  himself ,  which  could  mean  that.  In  effect,  pronouns  are  competing  for 
antecedents:  in  some  cases,  two  pronouns  may  share  an  antecedent,  and  in 
the  other  'disjoint  reference'  cases  they  may  not.  Exactly  how  hard  is  this 
task? 

The  interactions  among  pronouns  can  become  impossibly  complex  when 
there  are  many  pronouns  and  only  a  few  possible  antecedents.  In  fact, 
determining  pronoun  antecedents  is  like  coloring  the  nodes  of  a  graph,  where 
possible  antecedents  are  colors  and  pronouns  are  the  nodes  of  a  graph  whose 
edges  represent  disjoint  reference  conditions.  This  correspondence  proves 
that  the  pronoun  antecedent  problem,  like  the  NT-complete  graph  coloring 
problem,  cannot  be  solved  in  practice  by  any  known  method.  How  then  can 
we  understand  sentences  containing  pronouns? 


8,1  Binding  at  S-structure 

Binding  theory  is  concerned  with  the  permissible  linguistic  antecedents  of 
anaphora,  such  as  pronouns,  reflexives,  and  reciprocals.  Binding  theory 
states  conditions  on  binding  relations  between  A-positions  (positions  as¬ 
signed  a  grammatical  function,  such  as  SI  B  JECT  or  OBJECT )  at  S-structure 
and  LF.  Chomsky  (1981:188)  formulates  the  binding  theory  as  follows: 

(!9)  4  .  .  ,  .  .  . 

a.  An  anaphor  is  bound  in  its  governing  category. 

b.  A  pronominal  is  free  in  its  governing  category. 

c.  An  R-expression  is  free  everywhere. 
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( Arguments  arc  bound  bv  c  commanding  antecedents,  :!>e  free;  a  governing 
category  for  a  is  the  minimal  XF  or  S  containing  «  and  a  governor  of  a. )  T  wo 
additional  binding  conditions  are  ( 1 )  anaphora  must  find  their  antecedents  in 
some  level  of  linguistic  representation,  such  as  S-structure,  LF,  or  discourse 
structure,  and  (2)  anaphora  that  share  an  antecedent  must  agree  with  each 
other  and  with  that  antecedent  on  agreement  features.4  Occasionally,  a 
third  condition,  the  i-withm-i  condition ,  is  cited  to  prohibit  categories  from 
being  antecedents  of  anaphora  they  dominate. 

A  syntactic  binding  is  a  set  of  anaphor-antecedent  relations  that  satisfies  the 
binding  theory,  such  that  all  anaphora  have  S-structure  antecedents.  The 
binding  problem  is; 

Binding  Problem  (BINDING) 

Given  a  syntactic  representation  S  lacking  only  binding 

relations,  construct  a  permissible  syntactic  binding  for  5, 

according  to  binding  theory. 

The  corresponding  decision  problem  is  to  decide  if  a  given  syntactic  repre¬ 
sentation  has  a  permissible  syntactic  binding.  The  all  bindings  problem  is 
to  enumerate  all  permissible  syntactic  bindings  for  a  given  syntactic  repre¬ 
sentation. 

Theorem  0  The  binding  decision  problem  is  NP-hard. 

Proof.  The  proof  will  be  by  a  trivial  reduction  from  GRAPH  F  COhORABILITY 
to  BINDING.  On  input  graph  G  =  (V,E)  with  vertices  i'x...vn,  we  need 
n  binary  agreement  features,  n  pronouns,  and  k  R-expressions.  Pronoun  p, 
represents  vertex  i>, :  foi  each  edge  (in.v,)  from  r,.  pronoun  p,  has  pi  —  0 
and  pronoun  p,  has  <p,  -  1.  ui  the  S-structure.  the  it h  R-expression  fl, 
c-commands  the  t  +-  1th  R-expression  R, 4-j,  and  each  pronoun  is  free  in  its 
governing  category,  where  it  is  c-commanded  by  all  R  expressions,  which  are 
its  potential  antecedents. 

'The  transitivity  of  agreement  is  trivially  demonstrated  by  the  ungrammaticality  of 
Every  student,  prepared  her,  plan  and  did  his,  homework ,  where  each  pronoun  agrees  with 
its  operator  antecedent,  but  the  two  pronouns  disagree  with  each  other,  and  though  neither 
pronoun  is  the  others  antecedent,  the  fact  that  they  disagree  through  their  common 
antecedent  uon*thel*ss  excludes  the  example 


/ 

M, 


Each  pronoun  must  ultimately  have  <ui  K  expression  antecedent  <r<>  satisfy 
the  binding  theory),  which  will  c-command  it  (by  construction  l.  All  R 
expressions  have  disjoint  reference  by  binding  condition  (\  and  therefore 
there  are  exactly  k  distinct  possible  antecedents.  If  there  is  an  edge  between 
two  vertices  in  the  graph,  then  the  two  c  "-esponding  pronouns  cannot  share 
an  antecedent  without  disagreeing  on  some  agreement  feature;  there  are  only 
fc  possible  antecedents  (the  R  expressions ),  and  therefore  each  permissible 
binding  for  the  S-structure  corresponds  »o  a  I-coloring  of  the  graph  G h  ; 

Comments.  Crucially,  the  reduction  proves  that  it  is  the  combination  of 
disagreement,  binding  conditions  B  and  C,  and  prohibitions  against  circular 
dependencies  that  creates  inherently  complex  disjoint  reference  graphs.  (The 
disjoint  reference  graph  in  the  preceding  reduction  was  created  entirely  via 
feature  disagreement  simply  because  that  was  the  simplest  reduction.) 

Although  the  reduction's  extensive  use  of  agreement  features  does  not  in¬ 
crease  our  understanding  of  natural  language,  it  does  offer  greater  insight 
into  linguistic  theory.  It  should  be  clear  that  agreement  features,  a  much 
neglected  part  of  linguistic  theory,  are  surprisingly  powerful.  A  reduction 
based  on  the  other  disjoint  reference  conditions  (binding  conditions  B  and 
C.  and  i-within-i),  with  perhaps  a  constant  number  of  agreement  features, 
might  offer  better  insights  into  binding  phenomenon. 

The  other  weakness  is  that  the  reduction  requires  an  unnaturally  arbitrary 
specification  of  agreement  features  on  pronouns  (see  below).  Functionally, 


pronouns  and  anaphors  serve  to  -itnplifv  linguist ir  expressions,  and  t heretore 
one  would  expect,  contrary  to  the  reduction  construction,  that  pronouns  and 
anaphors  specified  for  different  agreement  features  would  also  have  different 
antecedents.  But  this  weakness  is  ultimately  attributable  to  a  failure  of 
current  linguistic  theory  to  view  anaphora  front  this  perspective. 

The  great  strength  of  this  reduction  is  that  it  is  based  directly  on  a  natu¬ 
ral  syntactic  construction,  whose  pronouns  must  find  antecedents  in  order 
for  the  construction  to  he  understood.  Therefore,  the  binding  problem  is 
a  necessary  subproblem  of  language  comprehension.  It  also  raises  a  sig¬ 
nificant  computational  question  for  the  binding  theorv.  namely,  is  binding 
really  so  much  like  graph  coloring,  a  problem  which  is  hyper  -exponentially 
ambiguous  and  cannot  be  efficiently  performed  bv  any  known  method,  even 
approximately  (Lawler.  1976!.’ 

8.2  Limiting  disagreement  with  the  transparency  condition 

The  most  unnatural  characteristic  of  the  binding  construction  used  in  the 
preceding  reduction  from  eraph  coloring  is  that  two  pronouns  with  entirely 
different  features  could  nonetheless  sfovre  an  antecedent,  provided  that  they 
did  not  disagree  on  any  common  feature.  That  is.  two  pronouns  could  share 
an  antecedent  if  their  agreement  features  could  he  unified;  fhey  need  not 
agree  exactly,  but  they  must  not  have  different  values  for  the  «ame  feature. 
In  natural  language,  if  seems  that  an  argument  may  bind  anaphora  that  are 
virtually  identical,  modulo  case  and  affix  (for  example,  the  anaphoric  suffix 
-self).  Let  us  say  that  two  anaphora  are  transparently  identical  if  they  have 
the  same  root  morpheme  and  a^rne  exactly  on  agreement  features,  although 
they  may  have  different  cases  or  affixes.  This  relation  partitions  anaphora 
into  equivalence  classes,  which  we  may  cal!  transparency  classes  (t-classes). 
For  example,  he.  him.  and  himsf  if  are  transparently  identical,  and  form  one 
t-class  in  the  English  lexicon. 

In  Light  of  these  observations.  I  propose  an  additional  binding  condition, 
which  limits  interactions  among  r  identical  anaphora 

Transparency  condition 

All  anaphora  with  the  same  antecedent  must  be  t  identical. 

The  transparency  condition  exph-.in-  -vhv.  in  some  dia'-  t f  s  of  English,  a 
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masculine  singular  antecedent  can  independent^  hind  t h»*  pronouns  ; ami 
their,  but  cannot  simultaneously  bind  both  pronoun'  in  the  same  stru<  ture: 

1  a.  Every  boy,  did  his,  homework. 

b.  Every  boy,  did  their,  homework. 

c.  *  Every  boy,  did  his,  homework  and  prepared  their,  lesson  plan. 


a.  Sue  saw  the  dog,  when  it,  barked. 

b.  Sue  saw  the  dog,  after  he,  was  kicked. 

c.  *  Sue  saw  the  dog,  when  it.  barked  after  he,  was  ki>  ked. 

(Note  how  poor  the  second  example  is  with  the  pronouns  ht  and  it  inter¬ 
changed.)  The  transparency  condition  can  also  explain  why  disjoint  refer¬ 
ence  conditions  only  appear  to  hold  within  a  t-class: 

(22)  Bill,  reminded  Sue,  that  gc,thtir)  he.  paid  for  riieir  ,  .  tickets 

A  compound  noun  may  be  the  antecedent  of  two  t  classes,  in  apparent  vi¬ 
olation  of  the  transparency  condition.  ( Of  course,  it  is  always  possible  to 
make  up  a  story  where  the  compound  noun  posses  two  indices,  as  in  baby 
broth ,  rj  ,. ) 

(23/  Susie's  baby  brother  ,  cried  when  she  left  it,,  nut  soon  thereafter 
he,  went  to  sleep. 

t  nfortunately,  although  this  transparency  condition  successfully  prevents 
the  particular  details  of  the  preceding  i eduction,  it  does  not  in  general  suf¬ 
fice  to  limit  binding  interactions.  Now  t-classes.  rather  than  individual  pro¬ 
nouns,  compete  for  antecedents  in  the  presense  of  disagreement  conditions. 
In  the  next  section  I  will  present  a  pair  of  binding  constructions  that  satisfy 
the  transparency  condition,  and  then  use  rhose  constructions  to  simulate 
Q1  AN’TIHED  3SAT.  Moreover,  the  disjoint  ref  erence  graph  arising  from 
binding  conditions  R  and  C  remains  unaffected 
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9  Binding  and  Copying  in  Syntax 

Language  users  inwardly  relate  a  complete  meaning  to  every  sound  uttered 
as  language:  that  is,  for  a  language  user,  every  sound  is  a  complete  utterance. 
This  gives  language  tremendous  expressive  power,  especially  when  taken  in 
combination  with  language’s  intricate  underlying  structure. 

For  example,  the  sentence  the  men  ate  but  the  women  didn't  is  understood 
to  mean  ‘the  men  ate  but  the  women  didn't  eat.'  In  so-called  VP-deletion 
constructions,  the  second  verb  phrase  is  an  invisible  copy  of  the  first  verb 
phrase:  understood,  but  not  spoken.  If  the  verb  phrase  contains  a  pronoun, 
that  pronoun  may  be  understood  as  though  it  were  also  copied,  because 
the  pronoun  and  its  copy  may  have  different  antecedents.  For  example,  the 
sentence  the  men  ate  their  dinner  but  the  women  didn't  can  mean  ‘the  men 
ate  the  men’s  dinner  but  the  women  didn’t  eat  the  women’s  dinner.’ 

If  the  twin  subjects  of  a  VP-deletion  construction  disagreed  on  some  feature 
(gender  in  the  preceding  examples)  and  the  pronoun  in  the  deleted  verb 
phrase  represented  a  variable  in  a  Boolean  formula,  then  such  a  so-called 
sloppy  identity  construction  would  represent  a  universal  quantifier  over  that 
feature:  the  construction  would  be  permissible  only  if  the  pronoun  could 
could  be  independently  bound  by  both  subjects.  This  is  possible  only  if  the 
pronoun  can  agree  with  both  (contrary)  feature  settings. 

A  slightly  different  binding  construction  mimics  an  existential  quantifier.  In 
the  sentence  the  men  knew  the  women  ate  their  dinner,  either  subject  may 
bind  the  pronoun  their.  Continuing  with  the  idea  of  pronouns  as  variables 
and  antecedents  as  truth  values,  this  binding  construction  represents  a  exis¬ 
tential  quantifier  over  some  feature,  where  the  pronoun  can  be  freely  bound 
by  either  of  the  two  antecedents  that  disagree  on  that  feature. 

A  third  binding  construction  can  represent  3SAT  instances.  The  sentence 
he  introduced  him  to  him  is  necessarily  about  three  people:  no  two  of  the 
three  pronouns  can  share  an  antecedent.  Similarly,  at  least  one  of  the  three 
pronouns  in  the  sentence  John  thought  Peter  said  he  introduced  him  to  him 
must  refer  to  a  third,  unnamed  made  person.  This  construction  can  be  used 
to  mimic  a  3- CNF  clause,  where  at  least  one  of  the  pronouns  must  be  bound 
outside  the  construction  to  an  antecedent  representing  a  quantified  variable. 
This  pronoun  represents  a  true  literal  in  the  3-CNF  clause. 

Finally,  if  the  VP-deletion  and  binding  constructions  are  put  together  prop- 
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erlv.  they  would  represent  quantified  Boolean  formulas.  Therefore,  deter¬ 
mining  the  meaning  of  such  Linguistic  utterances  can  he  as  difficult  as  playing 
hoard  games,  such  as  Checkers,  Chess,  and  Go. 


9.1  Linking  and  Copying  at  LF 

Every  linguistic  account  of  sloppy  identity  in  VP-deletion  construct  ions  re¬ 
quires  that  the  overt  VP  in  effect  be  copied  to  the  empty  VP  position,  and 
that  pronominal  binding  occur  both  before  and  after  that  copying  is  per¬ 
formed.  Some  representative  accounts  include:  copying  or  'reconstruction' 
of  the  VP  directly  at  S-structure  (Koster,  1987;  Ross.  1967);  copving  in 
the  S-structure  to  LF  mapping  (May,  1985);  copying  and  interpretation  in 
syntax  ( Wasow,  1972);  or  in  two  stages — copying  an  empty  VP  structure  at 
S-structure  and  interpreting  the  antecedent  VP  at  LF.  in  Sentence  Gram¬ 
mar.  and  then  copying  the  interpreted  antecedent  VP  after  LF.  in  Discourse 
Grammar  (Williams,  1977). 5  Although  these  theories  differ  in  the  exact  for¬ 
mulation  of  the  copying  rule,  at  what  level  of  representation  the  rule  applies, 
and  the  ride’s  name,  real  differences  are  extremely  minor,  and  do  not  affect 
the  subsequent  discussion  or  the  proof  at  all. 

Without  loss  of  generality,  I  will  restrict  my  attention  to  binding  and  copying 
at  LF  in  the  T-tnodel  (for  example,  May  1985).  In  the  T-model  anaphora- 
antecedent  relations  may  be  determined  both  at  S-structure  and  LF.  while 

'Williams  (1977)  proposes  that  Sentence  Grammar  contains  an  optional  Pronoun  Rule 
that  replaces  pronouns  inside  a  VP  with  variables  bound  by  lambda  operators  whose  scope 
is  the  entire  VP  For  example,  John  s aw  hu  mother  is  assigned  the  LF  ;John  ;Xz  (z  saw 
z  's  mother  )Jj  when  the  Pronoun  Rule  applies,  and  John  Xz  (z  saw  hu  mother:  I’  when 
it  does  not.  (Note  that  Williams  claims  both  LF  representations  have  the  same  meaning, 
but  his  failure  to  explicitly  relate  the  pronoun  his  to  its  antecedent  in  the  second  LF 
representation  is  incoherent.)  Discourse  Grammar  strictly  follows  Sentence  Grammar,  and 
contains  a  VP  Rule  that  copies  entire  VP-lambda  expressions.  Sloppy  identity  readings 
occur  when  VP  copying  applies  to  the  output  of  the  Pronoun  Rule;  alternate  readings 
occur  when  the  optional  Pronoun  Rule  does  not  apply.  Therefore,  Williams'  account 
would  seem  to  preclude  free  pronoun  binding  after  VP  copying.  But,  by  Williams’  own 
arguments,  the  Pronoun  Rule  must  belong  to  Discourse  Grammar  because  it  operates 
across  sentence  boundaries: 

*  a.  Frank  loves  Sue. 

b.  No,  he  only  thinks  he  loves  her. 


Therefore,  Williams  framework  must  also  allow  pronoun  binding  to  occur  both  before 
and  after  VP  copying,  contrary  to  his  claims. 


u 


constituent  copying,  as  required  for  VP -deletion  constructions,  is  performed 
in  the  S-structure  to  LF  mapping.  For  example.  (25)  can  mean  either  that 
the  women  ate  the  women's  dinner,  or  that  the  women  ate  the  men's  dinner 
(see  26). 

(25)  The  men  didn’t  eat  their  dinner  hut  the  women  did 


(26) 

a. 

b. 


The  men,  didn't  eat  their,  dinner  but  the  womenj  ate  their, 
dinner 

The  men,  didn't  eat  their,  dinner  but  the  womenj  ate  their, 
dinner 


In  example  (26a),  binding  is  performed  before  copying  while  in  (26b).  bind¬ 
ing  is  performed  after  copying.  Conversely,  example  (27)  does  not  have  a 
reading  where  some  women  eat  some  man's  dinner. 

(27)  Which  men  didn't  eat  their  dinner  and  which  women  did 

This  is  explained  by  the  binding  theory,  which  states  that  pronouns  may 
only  link  to  A-positions,  and  the  LF  binding  conditions,  which  require  each 
operator  to  c-command  its  variable  and  all  anaphora  linked  to  that  variable. 
Therefore,  in  this  VP-deletion  construction,  anaphora  can  only  be  bound  by 
the  subject  of  the  VP  that  contains  them  at  LF. 

I  now  make  crucial  use  of  sloppy  identity  in  VP-deletion  constructions  to 
prove  that  the  S-structure*to  LF  mapping  is  extremely  difficult.  Informally, 
if  we  embed  some  VP-deletion  constructions,  and  hide  a  parameterized  com¬ 
putation  in  the  innermost  VP,  we  might  then  have  to  perform  that  compu¬ 
tation  differently  an  exponential  number  of  times. 

The  reduction  is  dearest  when  presented  assuming  the  linking  theory  of  Hig¬ 
ginbotham  (1983),  which  replaces  the  indices  of  the  standard  binding  theory 
with  directed  edges  (links)  from  anaphor  to  antecedent,  and  the  stipulatorv 
i-within-i  condition  with  a  more  natural  (semantic)  prohibition  against  cir¬ 
cular  dependencies.  (An  element  depends  on  all  elements  it  dominates,  and 
on  all  elements  its  antecedents  depend  on.) 


Theorem  7  The  S-structure  to  LF  mapping  is  PSPACE-hard. 


Proof.  By  reduction  from  QUANTIFIED  3SAT.  The  input  f!  is  a  quanti¬ 
fied  Boolean  formula  in  prenex  3-CNF,  consisting  of  alternating  quantifiers 
ill  zx2  ■  ■  ■  Vfn_irxn  preceding  (and  quantifying  the  literals  in)  the  clauses 

rVC'2 . C'p  in  the  Boolean  variables  *1, £2, . . .  Each  clause  contains 

exactly  three  distinct  literals  labeled  by  C,  =  (a,  V  6,  Vc, ).  The  output  is  an 
S-structure  S  such  that  5  has  a  permissible  LF  representation  iff  fi  is  true. 

Each  quantifier  (Vr,  or  3r,)  is  represented  by  a  S-structure  construction 
Q  c„  where  antecedents  in  A- positions  represent  the  quantified  variables. 
Each  Boolean  clause  C,  is  represented  by  an  S-structure  construction  CCj. 
where  pronouns  represent  formula  literals,  and  at  least  one  pronoun  in  each 
CCj  must  link  to  some  c-commanding  A-position  in  one  of  the  dominating 
QC,  constructions.  Thus,  the  entire  QUANTIFIED  3SAT  instance  is  rep¬ 
resented  by  an  S-structure  composed  of  QC  and  CC  constructions:  each 
QC,  immediately  contains  QCtTl;  QCn  immediately  contains  CCi;  and 
each  CCj  immediately  contains  CCJ+t.  as  shown  in  figure  5. 

A  universal  quantifier  is  represented  by  a  VP-deletion  construction.  In 
the  VP-deletion  construction  QC,,  the  disagreeing  operator- variable  chains 
(Op,,  lei,)  and  (Op,,  feji )  represent  the  quantifier  determined  truth  assign¬ 
ments  x,  =  1  and  Xi  =  0,  respectively.  S-structure  linking  conditions  require 
pronouns  to  only  link  to  A-positions  (in  this  case,  to  the  syntactic  variables 
in  subject  position).  Feature  agreement  on  the  ID  feature  requires  pronouns 
representing  literals  of  xx  to  link  to  either  ej,  or  [ejt,  and  never  to  another 
A-position  in  another  QC  construction.  Recall  that  linking  conditions  at  LF 
require  each  operator  to  c-command  its  syntactic  variable  and  all  anaphora 
linked  to  that  syntactic  variable.  Therefore,  because  each  variable  in  subject 
position  is  locally  bound  by  an  operator,  pronouns  contained  in  the  original 
VP  must  link  to  the  syntactic  variable  [ej,  linked  to  Op,;  these  pronouns 
will  correspond  to  unnegated  literals,  which  are  true  iff  xt  is  true.  Similarly, 
pronouns  in  the  copied  VP  must  link  to  the  syntactic  variable  [e]t  linked  to 
Op,;  these  pronouns  will  correspond  to  negated  literals,  which  are  true  iff 
r,  is  false. 

An  existential  quantifier  is  represented  by  a  linking  ambiguity.  In  the  linking 
ambiguity  construction  QC,+i,  the  R-expressions  NP.^i  and  NP,+1  repre¬ 
sent  the  two  possible  truth  assignments  r,^i  =  1  and  r,+1  =  0,  respectively. 
Structural  conditions  (c-command)  and  feature  agreement  on  the  ID  feature 
force  the  pronoun  p'j.,  to  link  to  either  NP,^  or  NP,rl.  The  pronoun  p'rl 
bears  an  additional  feature,  indicated  by  that  disagrees  with  a  feature 


Figure  5:  The  S-structure  in  the  figure  represents  the  QUANTIFIED  3SAT  in¬ 
stance  Vxt  3*2  ...  Vxn_i3*nCi ,  C2,  .. C'p .  Each  universal  quantifier  Vxt  is  repre¬ 
sented  by  a  VP-deletion  construction.  At  LF,  each  of  the  n/2  circled  overt  VPs  is 
copied  to  its  empty  VP  position  [vp  ej.  The  resulting  LF  is  exponentially  larger 
than  t  j.t  S-structure  shown.  Each  existential  quantifier  3*,>i  is  represented  by  a 
linking  ambiguity  construction.  Both  quantifier  constructions  contain  permissible 
antecedents  that  represent  possible  truth  assignments  to  their  quantified  variable. 
Each  3-CNF  clause  Cj  is  represented  by  a  CC;  construction  that  contains  a  selected 
pronoun  that  must  link  outside  that  construction,  to  a  permissible  antecedent  in 
some  dominating  QC<  construction.  These  obligatory  longdistance  links  are  drawn 
with  dashed  arrows.  Each  selected  pronoun  represents  a  true  literal  in  its  3-CN'F 
clause. 


Figure  6:  The  VP-deletion  construction  QC,  represents  the  universal  quantifier 
V*j.  The  disagreeing  operator-variable  chains  (Opi,  [e]j)  and  (Op*,  [e]i )  represent 
the  truth  assignments  =  1  and  =  0,  respectively.  LF  linking  conditions  ensure 
that  pronouns  contained  in  the  CC  constructions  may  only  link  to  the  subject  of 
the  VP  that  contains  them  (identified  by  bold  arrows).  Therefore,  only  pronouns 
in  the  original  VP  representing  unnegated  literals  of  x,  may  link  to  [ej,.  Simi¬ 
larly,  only  pronouns  in  the  copied  VP  representing  negated  literals  of  Xi  may  link 
[e]t.  The  circled  VP  leaf  node  will  immediately  dominate  the  QC,ti  construction 
representing 


If) 


Figure  8:  The  S-structure  construction  CC;  represents  the  clause  C}  of  the  3-CNF 
formula.  Only  two  antecedents  (NP;l,  NP^j)  are  available  to  the  three  disjoint  pro¬ 
nouns  aj,bj,Cj  in  the  construction.  Therefore  at  least  one  of  the  three  pronouns 
must  be  linked  to  one  of  the  c-commanding  antecedents  in  some  dominating  QC, 
construction.  The  NPJ  +  i  leaf  will  immediately  dominate  the  next  CCj  +  1  construc¬ 
tion. 


borne  by  all  pronouns  in  the  CC  constructions.  Thus,  feature  disagree¬ 
ment  prevents  both  p*+1  and  its  antecedent  (either  NP,+i  or  NPl+1)  from 
binding  any  pronoun  (representing  a  literal  of  r,+i)  contained  in  any  CC_,. 
Therefore,  those  pronouns  in  CC;  must  be  bound  by  the  NP  subject  that 
p*M  does  not  link  to.  For  example,  if  p*+1  links  to  NP,-i,  then  NP,  +  i 
may  bind  pronouns  in  any  CC;  that  correspond  to  7,+1,  which  is  true  iff 
=  0.  Otherwise,  p*+1  must  link  to  NP,*j,  and  then  NP1  +  1  will  bind 
those  pronouns  that  correspond  to  r,  +  i,  which  is  true  iff  xt  +  i  -  1. 

3-CNF  clauses  are  represented  by  linking  ambiguities.  In  the  S-structure 
construction  CCJ?  each  clause  Cj  of  the  Boolean  formula  is  represented  by 
a  noun  phrase  NP;.  and  each  literal  of  the  Boolean  clause  is  represented  by 


a  pronoun  with  a  truth  value  that  makes  that  literal  tnm  ( Consequent lv. 
pronouns  representing  negated  literals  will  only  agree  with  Op,  and  \P,_i 
in  some  dominating  QC,  construction.) 

Feature  agreement  forces  pronouns  p.y  and  pj2  to  both  link  to  N'P; .  ( .  There¬ 
fore.  no  pronoun  dominated  by  N'P.»i  may  have  NP,,.  NP.>.  p., ,  or  p ,,  as 
an  antecedent  without  creating  an  impermissible  circular  dependence.  The 
three  pronouns  a]xb]xCj  c-command  NPj^i  and  all  it  contains;  therefore, 
they  cannot  be  linked  to  any  A-posirion  below-  them.  Nor  may  they  be  linked 
to  each  other  or  otherwise  share  an  antecedent  without  violating  linking  con¬ 
dition  B.  because  they  are  in  the  same  governing  category.  (Nothing  hinges 
on  being  able  to  stuff  three  pronouns  in  one  governing  category,  because 
disjoint  reference  can  always  be  accomplished  with  feature  disagreement.) 
Moreover,  because  only  two  distinct  antecedents  (NP,!.  NP;2)  are  available 
to  the  three  disjoint  pronouns  in  the  entire  CC^  construction,  at  least  one  of 
the  three  pronouns  must  be  linked  to  —  and  agree  with  —  a  c-commanding 
subject  outside  CC in  some  dominating  QC,  construction.  This  pronoun 
represents  a  true  literal  in  the  Boolean  clause  C2:  it  agrees  w-ith  the  subject 
it  is  linked  to  in  some  QC,,  and  is  called  the  selected  pronoun.  Note  that 
the  CC  constructions  are  contained  inside  n/2  VP-deletion  constructions 
at  S-structure.  and  that  therefore  the  corresponding  LF  representation  will 
contain  2"' 2  copies  of  the  CC;  S-structure,  each  capable  of  selecting  its  own 
pronoun. 

The  selected  pronoun  may  represent  a  literal  of  either  a  universally  or  ex¬ 
istentially  quantified  variable  r,.  If  ix  is  universally  quantified,  then  the 
agreement  condition  ensures  that  only  pronouns  p,  corresponding  to  un¬ 
negated  literals  r,  and  contained  in  the  original  VP  at  LF  can  link  to  the 
subject  [ej,  of  the  original  VP.  Conversely,  only  pronouns  ~pl  corresponding 
to  negated  literals  77  and  contained  in  the  copied  VP  at  LF  can  link  to  the 
subject  [ej,  of  the  copied  VP. 

Otherwise,  r,  is  existentially  quantified,  and  the  agreement  condition  ensures 
that  pronouns  p,  corresponding  to  unnegated  literals  r,  ran  only  link  to  NP,, 
that  pronouns  p7  corresponmng  to  negated  literals  77  can  only  link  to  NP,. 
Recall  that  if  any  pronoun  in  any  CC;  links  to  NP,  (or  NP, ),  then  all 
pronouns  in  all  CC;  must  link  to  that  antecedent. 

The  CC j  construction  is  permissible  iff  all  of  its  LF  copies  are  permissible, 
which  is  only  possible  when  the  Boolean  clause  C,  contains  a  true  literal  for 
any  possible  quantifier-determined  truth  assignment  to  its  literals,  as  rep- 


resented  by  the  dominating  QC  constructions.  Therefore,  the  LF  r^pre-tm- 
tation  of  the  entire  S-structure  construction  is  permissible  iff  the  quantified 
formula  fi  is  true.i  | 

Note  that  the  binding  constructions  of  this  proof  can  also  be  used  to  ■'ive 
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more  than  four-ways  ambiguous. 

In  our  examination  of  transformational  theories  of  syntax,  it  was  not  clear 
how  the  complexity  of  the  lexical  resolution  problem  reflected  on  the  com¬ 
plexity  of  the  more  general  language  comprehension  problem  because  it  was 
conceivable,  although  unlikely,  that  the  phonological  form  corresponding  to 
the  complex  S-structure  of  the  reduction  coulu  also  correspond  to  a  simple 
S-structure.  If  that  were  the  case,  the  hearer  couli  a l  way  the  simpler 

S-structure,  the  difficult  instances  of  the  LRP  would  not  arise  in  practice, 
and  the  complexity  of  the  LRP  would  not  directly  bear  on  the  conipicxh v 
of  language  perception. 

But  in  the  preceding  proof,  and  in  the  proof  that  the  S-structure  binding 
decision  problem  ir  Np-hard,  the  phrase  structure  constructions  are  trivial: 
there  is  no  movement,  all  positions  are  strongly  identified,  and  all  features 
are  explicit  on  the  lexical  items.  The  lack  of  ambiguity  in  and  the  directness 
of  the  PF  to  S-structure  mapping  for  these  constructions  provides  extremely 
powerful  evidence  that  language  perception  is  intractable. 

A  central  consequence  of  the  PSPACE-hard  result  is  that  linguistic  rep 
resentations  in  the  T-model  do  not  have  efficient  witnesses,  unless  NP  = 
PSPACE.  In  another  words,  it  is  not  possible  to  efficiently  determine  the 
grammaticality  of  a  complete  linguistic  representation  (in  our  case,  a  com¬ 
pletely  specified  D-structure,  S-structure,  LF,  and  PF).  Therefore,  using 
linguistic  knowledge  in  language  comprehension  can  be  intractable.  (The 
complexity  of  language  comprehension  is  measured  relative  to  the  size  of 
the  phonological  form  or  surface  structure.) 

Theorem  8  Using  linguistic  knowledge  is  PS  PACE-hard. 

Proof.  A  corollary  to  theorem  7.  given  the  triviality  and  directness  of 
the  PF  to  S-structure  mapping  (hearing)  and  its  inverse  (speaking)  for  the 
VP  deletion  Tinking  ambiguity  construction  of  theorem  T.j  | 


